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ABSTRACT: 

Let {or : r G N'^} be a d-dimensional array of numbers, for which the generating function 
T(z) := Orz'’ is meromorphic in a neighborhood of the origin. For example, -F may be a rational 
multivariate generating function. We discuss recent results that allow the effective computation of 
asymptotic expansions for the coefficients of F. 

Our purpose is to illustrate the use of these techniques on a variety of problems of combinatorial 
interest. The survey begins by summarizing previous work on the asymptotics of univariate and 
multivariate generating functions. Next we describe the Morse-theoretic underpinnings of some 
new asymptotic techniques. We then quote and summarize these results in such a way that only 
elementary analyses are needed to check hypotheses and carry out computations. 

The remainder of the survey focuses on combinatorial applications, such as enumeration of words 
with forbidden substrings, edges and cycles in graphs, polyominoes, and descents in permutations. 
After the individual examples, we discuss three broad classes of examples, namely functions de¬ 
rived via the transfer matrix method, those derived via the kernel method, and those derived via 
the method of Lagrange inversion. These methods have the property that generating functions de¬ 
rived from them are amenable to our asymptotic analyses, and we describe further machinery that 
facilitates computations for these classes of examples. 
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1. Introduction 


The purpose of this paper is to review recent developments in the asymptotics of multivariate 
generating functions, and to give an exposition of these that is accessible and centered around 
applications. The introductory section lays out notions of generating functions and their asymptotics, 
and delimits the scope of this survey. 

We employ the standard asymptotic notation: “/ = 0(g)” means limsup^^^^ \ f{z)/g{z)\ < oo, 
where the limit zq is specified but does not appear in the notation; “/ = o(g)” means f /g ^ 0 and 
“/ ~ 5” means f jg —> 1, again accompanied by a specification of which variable is bound and the 
limit to which it is taken. When we say “a„ = 0 ( 5 ( 71 ))” we always mean as n ^ 00 ; for multivariate 
arrays and functions, statements such as “ur = 0 ( 5 (r))” must be accompanied by a specification 
of how r is taken to infinity. We typically use r for (ri, ..., r^) G and z for (zi, ..., Zd) G C"^, 
but sometimes it is clearer to use (r, s, t) for (ri, r2, ra) or {x, y, z) for (zi, Z2, Z3). The norm |r| of a 
vector r denotes the norm \rj\. 


1.1. Background: the univariate case. Let {an : n > 0} be a sequence of complex numbers and 
let /(z) := J2n be the associated generating function. For example, if an is the Fibonacci 
number, then /(z) = 1/(1 — z — z^). Generating functions are among the most powerful tools in 
combinatorial enumeration. In the introductory section of his graduate text Sta97j| , Richard Stanley 
deems a generating function /(z) to be “the most useful but most difficult to understand method 
for evaluating” a„, compared to a recurrence, an asymptotic formula and a complicated explicit 
formula. 


There are two steps involved in using generating functions to evaluate a sequence: first, one must 
determine / from the combinatorial description of {a„}, and secondly one must be able to extract 
information about a„ from /. The first step is partly science and partly an art form. Certain 
recurrences for {a„} translate neatly into functional equations for / but there are numerous twists 
and variations. Linear recurrences with constant coefficients, such as in the Fibonacci example, 
always lead to rational generating functions, but often there is no way to tell in advance whether 
/ will have a sufficiently nice form to be useful. A good portion of many texts on enumeration is 
devoted to the battery of techniques available for producing the generating function /. 

The second step, namely estimation of a„ once / is known, is reasonably well understood and 
somewhat mechanized. Starting with Cauchy’s integral formula 

(1.1) a„ = ^ y z~'^~'^f{z)dz, 

one may apply complex analytic methods to obtain good estimates for an- As a prelude to the 
multivariate results which are the main subject of this survey, we will give a quick primer on the 
univariate case (Section [2] below), covering several well known ways to turn (11.111 into an estimate 
for an when n is large. 
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1.2. Multivariate asymptotics. In 1974, when Bender published the review article Ben74| . the 
extraction of asymptotics from multivariate generating functions was largely absent from the liter¬ 
ature. Bender’s concluding section urges research in this area: 


Practically nothing is known about asymptotics for recursions in two variables even 
when a generating function is available. Techniques for obtaining asymptotics from 
bivariate generating functions would be quite useful. 


By the time of Odlyzko’s 1995 survey [Odl95l| . a single vein of research had appeared, initiated by 
Bender and carried further by Gao, Richmond and others. Let the number of variables be denoted 
by d so that z denotes the d-tuple (zi,..., Zd)- We use the multi-index notation for products: 
z'’ := Suppose we have a multivariate array {or : r S N"’*} for which the generating 

function 


F{z) — ^ OrZ'’ 

r 

is assumed to be known in some reasonable form. We are interested in the asymptotic behavior of 
Or. For concreteness, one may keep in mind the following examples, which are discussed in detail in 
Sections 14.11 and 14.21 


Example 1.1 (binomial coefficients). Let Urs = (V"/)- The array {ars\ is Pascal’s triangle, oriented 
so that the rays of ones emanate from the origin along the positive r and s axes. From the recursion 
Urs = dr-i.s + o-r,s-i, holding whenever (r, s) ^ (0,0), one easily obtains 


F{x,y) 


E r s 

arsX y 

r,s>0 


1 

l-x-y' 


Example 1.2 (Delannoy numbers). Let Ors count the number of paths from the origin to the lattice 
point (r, s) made of three kinds of steps: 1 unit east, 1 unit north, and \/2 units northeast. These 
are called Delannoy numbers. The recursion ar,s = ctr-i,s + ctr,s-i + ctr-i,s-i leads immediately 
to 


F{x,y) 


OrsX^'y’’ 

r,s>0 


1 

1 — X — y — xy 


In both of these examples, E is a rational function. In one variable, obtaining asymptotics for 
the coefficients of a rational function is a quick exercise (see (EU in Section E] below) but in more 
than one variable this is far from true. For example, whereas the binomial coefficients are exact 
expressions easily approximated via Stirling’s formula, the Delannoy numbers do not have such a 
simple expansion (the expression (1) readily derived from the generating function, 

but no simpler one is forthcoming). 


In [Sta99l Example 6.3.8], it is shown how to get more information about the central Delannoy 
numbers a„„ by finding the generating function Xn ^nnz’"'. Uniform estimation of the general term 
Ors is not possible by this diagonal method (see Section [5?^ . though the problem succumbs easily 
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to multivariate methods (see (ll.4p below). The class of multivariate generating functions addressed 
in this survey is larger than the rational functions; the exact hypothesis is that the function be 
meromorphic in a certain domain which is spelled out in the remark following Theorem 13.161 

T he first paper to concentrate on extracting asymptotics from multivariate generatin g funct ions 


Ben73| . already published at the time of Bender’s survey, but the seminal paper is [BR83|. In 


was 

this paper, Bender and Richmond assume that F has a singularity of the form A/(zd — near 

the graph of a smooth function g, for some real exponent g, where x denotes (zi,... ,Zd-i)- They 
show, under appropriate further hypotheses on F, that the probability measure /j,„ one obtains by 
renormalizing {ur : = n} to sum to 1 converges to a multivariate normal when appropriately 

rescaled. Their method, which we call the GF-sequence method, is to break the d-dimensional 
array {ur} into a sequence of (d — l)-dimensional slices and consider the sequence of (d — l)-variate 
generating functions 

/n(x) = ^ UrZ'’- 

v.rd—n 

They show that, asymptotically as n ^ oo, 

(1.2) /„(x) ~ Cng{x)h{x.)"' 

uniformly over x in a certain ball, and that sequences of generating functions obeying m satisfy 
a central limit theorem and a local central limit theorem. 

We will review these results in more detail in Section [H but one crucial feature is that they always 
produce Gaussian (central limit) behavior. The applicability of the entire method is therefore limited 
to the single, thoug h impo rtant, case where the coefficients Or are nonnegative and possess a Gaussian 
limit. The work of [bR. 83I | has been greatly expand ed upon , but always in a similar framework. For 
example, it has been extended to matrix recursions BBW83| and the applicability has been extended 
from a lgebrai c to algebraico-logarithmic singularities of the form F ^ {zd — g(x))^ log“(l/(zd — 
g(x))) [GR93- The difficult step is always deducing asymptotics from the hypotheses (11.21) . Thus 
some papers in this stream refer to such an assumption in their titles, and the term “quasi-power” 
has been coined for such a sequence {fn}- 

The theory has also bee n pu shed forward via its use in applications. The forthcoming textbook 
by Flajolet and Sedgewick |FSl | devotes a chapter of nearly 100 pages to multivariate asymptotics, 
in which many of the basic results on quasi-powers are reviewed and extended. The reader is 
referred to this substantial body of work for an extensive collection of applications that can be 
ha ndle d by reducing multivariate problems to univariate contour integrals. The limit theorems 


FSl . Ghapter IX] (outside of th e graph enumera tion example in Section 11) are all Gaussian. The 


large deviation results of Hwang [Hwa96t iHwaQSlj are not restricted to the Gaussian case, but give 
asymptotics on a cruder scale. 


1.3. New multivariate methods. Odlyzko’s survey of asymptotic enumeration methods [Odl9f 
which is meant to be somewhat encyclopedic, devotes fewer than 6 of its 160 pages to multivariate 
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asymptotics. Odlyzko describes why he believes multivariate coefficient estimation to be difficult. 
First, the singularities are no longer isolated, but form (d — l)-dimensional hypersurfaces. Thus, 
he points out, “Even rational multivariate functions are not easy to deal with.” Secondly, the 
multivariate an alogue of the one-dimensional residue theorem is the considerably more difhcult 
theory of Leray Ler59l | . This theory was later fleshed out by Aizenberg and Yuzhakov, who spend a 
few pages AYS.ll . Section 23] on generating functions and combinatorial sums. Further progress in 
using mult ivariate residues to evaluate coefficients of generating functions was made by Bertozzi and 
McKenna BM93|, though at t he time of Odlyzko’s survey none of the papers based on multivariate 


residues such as 
to enumeration. 


Lic91 


BM93t had resulted in any kind of systematic application of these methods 


The top i c of t he pre sent review article is a recent vein of research begun in PW02l| and continued 


PW04 


BP 


Lla03l | and in several manuscripts in progress. The idea, seen already to some 


degree in BM93| , is to use complex methods that are genuinely multivariate to evaluate coefficients 
of multivariate generating functions via the multivariate Cauchy formula. By avoiding symmetry¬ 
breaking decompositions such as F{z) = fn{zi, ■ ■ •, Zd-i)z^, one hopes that the methods will be 
more universally applicable and the formulae more canonical. In particular, the results of Bender et 
al. and the results of Bertozzi and McKenna are seen to be two instances of a more general result 
that estimates the Cauchy integral via topological reductions of the cycle of integration. These 
topological reductions, while not fully automatic, are algorithmicall y decidable in l arge classes of 
cases and are the subject of Section [3l An ultimate goal, stated in [PW02I : IPWOdjj . is to develop 
computer software to automate all of the computation. 


Aside from providing a summary and explication of this line of r esearch , the present survey is 
meant to serve several other purposes. First, results from PW02t IPWOdt iBPj j are presented in 
streamlined forms, stated so as to avoid the scaffolding one needs to prove them. This is to make 
the results more comprehensible. Secondly, by focusing on combinatorial applications, we hope to 
create a sort of user’s manual: one that contains worked examples akin to those a potential user 
will have in mind. Many of the applications are to abstract combinatorial structures but we also 
include direct applications to computational biology and to formal languages and automata theory. 
Finally, we present a number of results that “pre-process” the basic, general theorems, providing 
useful computational reductions of the hypotheses or conclusions in specific cases of interest. We 
now give the notation for generating functions and their asymptotics that will be used throughout, 
and then briefly describe a prototypical asymptotic theorem. 


Throughout this survey, we let 


F(z) 


G{z) 

H{z) 


OrZ'' 


(1.3) 
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be a generating function in d variables, where G and H are analytic and H{0) ^ 0. Recall that in 
the bivariate case (d = 2) we write 


F{x,y) 


G{x,y) 

H{x,y) 


OO 

^ arsX^'y''. 
r,s—0 


The representation of T’ as a quotient of analytic functions is required only to hold in a certain 
domain, described in the remark following Theorem 13.161 though in the majority of the examples 
F is meromorphic on all of C^. We will assume throughout that FI vanishes somewhere, since the 
methods in this paper do not give nontrivial results for entire functions. 


We are concerned with asymptotics when |r| ^ oo with r := r/|r| remaining in some specified 
set, bounded away from the coordinate planes. Thus for example, when d = 2, the ratio s/r will 
remain in a compact subset of (0,oo). It is po ssible via our methods to address the other case, 
where r = o(s) or s = o(r), (see, for example, [Lla03l| ). but our main purpose in this pap er is to 
give examples that require, among all the methods an d results cited above, only those from PW02 | 
together with the simplest methods from 


pwn4 


BP|. 


To illustrate what sort of basic results we quote from PW02I : |PW04| . we quote the following 
combination of Corollarv l3.18l and Theorem 13.201 from Section [31 


Theorem 1.3. Let F he as in im and suppose Or > 0. 


(i) For each r in the positive orthant, there is a unique z(r) in the positive orthant satisfying 
the equations rdZjdFI/dzj = rjZddF[/dzd (1 < J < d — 1) from US.5\) below, and lying on the 
boundary of the domain of convergence of F; the quantity z(r) depends on r only through 
the direction r/|r|. 

(ii) With z(r) defined in this way, if G{z{r)) ^ 0, 


( 2 ^)-(<^- 1 )/ 27 ^- 1/2 


G(z(r)) 


-ZddH/dzd{zir)) 


(rd) 




(r)' 


uniformly over compact cones of r for which z(r) is a smooth point of {H = 0} uniquely 
solving (ESP on the boundary of the domain of convergence of F, and for whichTi. is nonzero, 
where Ti. is the determinant of the Hessian matrix of the function parametrizing the hyper¬ 
surface {H = 0} in logarithmic coordinates. 


Going back to Example 1 1.11 let us see what this says about binomial coefficients. It will turn out 
(see Section l4T|l that z(r) = (y^,y^). Evaluating the Hessian and the partial derivatives then 
leads to equation 63 ) below: 

/r + s\^/r + sV /r + s 
\ 1 " ) \ s ) \ 2'Krs 
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as r, s ^ oo at comparable rates. This agrees, of course, with Stirling’s formula. When we try this 
with the Delannoy numbers from Example 11.21 we find that 


z(r,s) = 


_|_ g2 _ g ^^2 _|_ g2 _ , 


and that 
(1.4) 


_ , 


\/r2 + s2 _ . 


Stt V (r + s - 


uniformly as r, s ^ oo with r/s and s/r remaining bounded. Setting r = s = n gives the estimate 

I i 0 — 1/4 

V 27rn 2 - 

for the central Delannoy numbers. This estimate can also be computed, with more effort, by using the 
methods of Section lOl applie d to th e diagonal generating function, which can itself be derived using 
the diagonal method as in Sta99l Example 6.3.8] (we discuss the diagonal method in Section 15?^ . 


In the sections to follow, we work through a number of applications of this and other, newer 
theorems to problems in combinatorial enumeration. While doing so, we develop companion results 
that simplify the computations in certain classes of examples. A prototypical companion result is 
the following result for implicitly defined generating functions, which arise commonly in recursions 
on trees. 


If f{^) = ^4>{f{^)) then the coefficient of f is given asymptotically by 


(1.5) 


7-3/2. 


yo^'iyoY' 


y/2TT(j)"{yo)/(l}iyo) 

where yo is the unique positive solution to y4>'{y) = 4>{y)- 


In FSjj the computation is reduced via Lagrange inversion to the determination of the coefficient 
of which they then carry out via complex integration methods. By viewing this coefficient 

instead as the (n, n) coefficient of 

Fix,y) := - -, 

1 - x(p(y) 

one sees that the complex integration step is already done, and that m is immediate upon iden¬ 
tifying that z{n,n) = {l/(j){yo),yo)-, see Proposition 16.11 where the hypotheses for (II.5p are stated 
more completely. 

The organization of the remainder of this paper is as follows. The next section gives the promised 
quick primer in univariate methods. Section [3] is the most theoretical section, outlining results from 
various sources to give a brief but nearly complete explanation as to how one goes from , via a 
multivariable Cau chy form ula, t o asym ptotic formulae for a^. We then quote the precise theorems 
we will use from PW02l| and PWOdj j. While we attempt here to provide short and accessible 


statements of results, readers interested only in knowing enough to handle a particular application 
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may wish to skip this section, find the closest match among the examples in Sections |4] - [3 and 
then refer back to the necessary parts of Section [3l Section |4] works in detail through a number of 
diverse examples. The subsequent three sections discuss collections of examples arising from three 
combinatorial methods: transfer matrices, Lagrange inversion and the kernel method, respectively; 
we present applications to enumerating various kinds of words, paths, trees and graphs. In Section[8] 
we discuss open questions and extensions of the material presented here, and compare our results 
with those of other authors. 


2. A BRIEF REVIEW OF UNIVARIATE METHODS 


Since our main subject is the asymptotics of coefficients of multivariate generating functions, we 
will give only a quick overview of the univariate case touching on three widely used methods, namely 
saddle point analysis, Darboux’ circle method and branch point contours. In each case we will give 
a brief summary and an application or two, as well as some pointers to the literature. For readers 
who wish to understand univariate asymptotic method s in greater detail, we recommend beginning 
with Sections 10-12 of Odlyzko’s survey paper Odl95| . 


We begin by disposing of a trivial case. If f{z) is a rational function, written as P{z)/Q{z) for 
polynomials P and Q, then a partial fraction decomposition exists: 


( 2 . 1 ) 


/w = E 


Pa , fc (2:) 

(1 — zja)^ 


where a ranges over roots of Q and the integer k is between 1 and the multiplicity of the root a. 


For j > i, the z^ coefficient of 


IS a 




j - i + k 


(1 — zja)^ k 

a„ = y^pa{n)c 


, which leads to a representation 


for easily computed polynomials Pa ■ 


We turn now to analyses via contour integrals. To see what underlies all of these methods, we 
begin with a preliminary estimate. Suppose that / has radius of convergence R. Taking the contour 
of integration in dD to be a circle of radius R — e gives an = 0{R — e) " for any 0 < e < i?. 
If / is continuous on the closed disk of radius R then a„ = 0{R~'^). All the methods we discuss 
refine these basic estimates by pushing the contour out far enough so that the resulting upper bound 
becomes asymptotically sharp. 


2.1. Saddle point methods. We begin with a worked example, then comment on the method in 
general. 
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Example 2.1 (set partitions). Let an be the number of (un ordered) partitions of {1,... ,n} into 
ordered sets. This has exponential generating function \R.io6R . page 194] 


/(»> - S = 


n—0 


z/(l-2 


To evaluate the Cauchy integral, we attempt to move the contour so it passes through a point where 
the integrand is not rapidly oscillating. Specifically, we let In = f(z)) denote the logarithm 

of the integrand and we find where I'n vanishes. It is not hard to see that 

I' = ^ ^ 


Z (1-Z)2 

vanishes at a value 1 —/3„ where /3„ = Expand the contour to a circle passing through 

1 — fn, or equivalently and more cleanly, a piece of the line 1 — fn + ix near a: = 0. Replacing the 
integrand exp(J„(z)) dz by its two-term Taylor approximation, one obtains 

-I /«nn 

an 1 


n! 27rz j 

^ exp(/„(l - Pn)) 
27T 

_ exp(J„(l - fn)) 
27r 


exp(/„(l - (3n + dt) 

J “ Pn){itf^ dt 


2^/"(l - pn) ■ 


The approximation may be justified by routine estimates. Plugging in the value of 1 — fin into I and 
shows the estimate to be asymptotically 


(1 + 0 ( 1 )) 


1 


47ren^/^ 


exp{2\4n). 


For this method to work either / must be entire or the saddle point (where l!^ vanishes) must be 
in the interior of the domain of convergence of /. These conditions are frequently satisfied and the 
method is widely applicable. We do not know when saddle point app roximat ion was first used in the 
context of generating functions. Hayman’s seminal paper from 1956 Hav56l| defines a broad class of 
functions, called admissible functions for which the saddle point method can be shown to work 
and the Gaussian approximation mechanized. A variation of the generating function exp(z/(l — z)) 
is analyzed in Hayman’s Theorem XIII. 


2.2. Circle methods. Suppose / has a positive, finite radius of convergence, i?. If there is no saddle 
point of z“"'“^/(z) in the open disk of radius R, one might try to push the contour of integration 
near or onto the circle of radius R. Darboux’ method is essentially this, with the refinement that if 
/ extends to the circle of radius R and is k times continuously differentiable there, then 

(2.2) j z-^f{z) dz = 0 . 

This follows from integration by parts. 
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The following very old theorem may be found, among other places, in HenQlL Theorem 11.10b: 
“Theorem of Darboux”]. 

Theorem 2.2 (Darboux). Let f{z) = = (r — z)°‘L{z) where r > 0, a is not a positive 

integer, and L is analytic in a disk of radius greater than r. Then 

.-1 L{r) 
r{-a) ■ 

Example 2.3 ( 2-regula r graphs). Let bn be the number of 2-regular graphs. The exponential gen¬ 
erating function fWiWA . equation (3.9.1)] is given by 


n 

Letting a = —1/2, r = 1 and L{z) = e-^/2-2^/4 


^n. ^ ^ 




gives 

bn e-3/4 


n\ 


To prove the above version of Darboux’ Theorem, write L{z) as m terms of a Taylor series about 
r plus a remainder. This expresses f{z) as 

m —1 

Ck{r - zr+^ + O {{r - zr+^) . 

k=0 

The known asymptotics for coefficients of (r — zY together with the estimate (12.21) is good enough 
to yield m — 1 terms in the conclusion provided one takes to > 1 + (Re{—a})"'", where x“*" denotes 
the maximum of x and 0. □ 

There are a great many variations on this, depending on how badly / behaves near the cir¬ 
cle of ra dius r. One of the most classical fruits of the circle method is Hardy and Ramanujan’s 
estimate 


HR17| of the partition numbers. 


Example 2.4. Letpn denote the number of partitions ofn, that is, representations of n as a sum of 
positive integers with the summands written in descending order. The number of identities involving 
these numbers and their relatives is staggering (see, e.g., lAEOl I for an elementary survey). Euler 
observed that the generating function may be written as 


Kz) ■= ^ 


PnZ = 


n—0 


n 

fe =0 


1 — z^ 


From this, Hardy and Ramanujan obtained 


Pn 


exp(7r-\/2n/3) 

4V^n 


as n 


oo. 
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2.3. Transfer theorems. A closer look at the proof of Darboux’ Theorem shows that one can do 
better. Analyticity of //(r — 0 )“ beyond the disk of radius r is used only to provide a series for / in 
decreasing powers of (r — z). On the other hand, taking the contour to be a circle of radius r loses 
one power of n in the estimate, though this doesn’t matter when the expansion of / has sufficiently 
many terms. 

In light of these observations, one sees that choosing a custom-built contour can simultaneously 
weaken the analyticity assumption while strengthening the estimate. The choice of contour will 
depend on the nature of the singularity of / on the boundary of its domain of convergence, but in 
many instances a good choice is known. Both a non-integral power of (r — z) and a logarithm or 
iterated logarithm of l/(r — z) are branch singularities and require similar contours. In the simplest 
case, the contour, pictured in Figure [H consists of an arc of a circle of radius 1/n around r, an arc 
of a circle of radius r — e centered at the origin, and two line segments connecting corresponding 
endpoints of the two circular arcs. 




Figure 1. contour for an algebraic singularity and the corresponding region of analyticity 
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Among the many refinements of Dar boux’ T heorem via such contours are the transfer theorems 
of Flajolet and Odlyzko. Their paper F09nl | is masterfully written and worth taking a couple of 
pages here to summarize. They begin by isolating the gist of (|2.2I) . namely that = 0(n“ if 
f{z) = 0(1 — 2 ;)“. Already this allows for / not to be in any nice class of functions, as long as it 
is bounded by a class we understand. Next, they use the above contour to improve this to a sharp 
estimate. 


(2.3) 


fiz)=0{l-zY 


= 0(n-“-^) 


Let fialg-log denote the class of functions that are a product of a power of r — z, a power of log(l/(r — 
z)) and a power of loglog(l/(r — z)). Flajolet and Odlyzko estimate an for any function in fialg-log. 
Their last crucial observation is that the implication in (12.3p gives an explicit constant for the bound 
on Qn based on the constant in the bound f{z) = 0(1 — z)“, which is sufficiently constructive to 
give 

f{z) = o({r-z)°') ^ an = . 

The surprisingly strong consequence of this is that any / with an asymptotic expansion by functions 
{gk} in fialg-log has coefficients asymptotically given by summing the asymptotics of the coefficients 
of the functions gk- 

The main result of |F09nt begins by stating what analyticity assumption is necessary in order to 
use the contour in Figure [TJ Given a positive real R and e G (0,7r/2), the so-called Camembert- 
shaped region, 

{z : |z| < i? -I- e, z ^ i?, I arg(z - R)\ > 7r/2 - e} , 
shown on the right of Figure [TJ is defined so that it includes the contour in Figure [TJ 


Theorem 2.5. Let f{z) = 'Y20‘nz'^ be singular at R but analytic in a Camembert-shaped region. For 
g{z) = YY bnz" in the class fialg-log, asymptotic relations between f and g as z ^ R imply estimates 
on the coefficients as follows. 


(i) 

(ii) 

(iii) 


/(z) = 0{g{z)) ^ an = 0{bn ); 
/(z) = o(g(z)) ^ a„ = o{bn ); 
/(z) ~ g{z) ^ On^bn- 


In particular, when /(z) ^ C(r — z)“, 


this result subsumes Theorem \2.2[ 


We have indicated the arguments deriving the second and third part of the theorem from the 
first. We now sketch, in t he sim plest case where g{z) = (1 — z)“, the upper estimate on a„, which 
appears as Corollary 2 of F09d |. The integral over the large circular arc is small because z“"“^ is 
exponentially small there. The integral over the small circular arc is 0(n“““^) because the integrand 
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is 0{n~°‘) and the arc has length 0{n~^). On the two radial line segments, the modulus of z~^~^ 
is small on the line segments except in a neighborhood of size 0(l/n) of 1. Here, changing variables 
to {z — l)/n, one has an integral of the f orm f A (z) exp(n</)(z)) dz from which one obtains (e.g., via 
the well known Watson-Doetsch lemma 

(2.4) 


Hen91 


H(l) 


Theorem 11.5]) a value 

^ —a —1 


r(-a) 


2 n 


□ 


be the Catalan number. A great many 


Example 2.6 (Catalan numbers). Let an '■= - 

n + 1 \ n 

na turally occurring combinatorial classes are counted by this sequence: there is a list of 66 of these 
in fStaQtA . Problem 6.19]. The generating function for the Catalan numbers is 




1 - VI-4z 


l-2j\-z 


n—0 


2z 2z 

There is an algebraic singularity at r = 1/4, near which the asymptotic expansion for f begins 
f{z) = 2 - 4^1^ + 8(i - z) - 16(i - zf/^ + 0(i - z)^ . 


Note that ff^lfA — z is not analytic in any disk of radius 1/4 + e, since both integral and half¬ 
integral powers appear in f, but f is analytic in a Camembert-shaped region. Theorem \2.5\ thus gives 
(note that the integral powers of (1 — z) do not contribute): 

l/2-n ^ 3/2-n 

'r(-l/ 2 ) + V4j r(-3/2) 

.-5/2(-16)(i)^/^ 


-3/2. 


+ 0(n-^/2) 


= 4”n 


= 4" 


-3/2 ( 4)(4)^/^ ' 


r(-i/ 2 ) 


r(-3/2) 


-f 


-3/2 


_n-5/2^ + 0(n-"/2) 

2 V TT 


We have given only a brief view of what is known in the univariate case. We close this section 
with several more pointers to the literature. 

One review of univa riate asymptotics, that predates the work of Flajole t and Odlyzko but is still 
quite useful, is [Ben74 part II]. The 1995 survey article by Odlyzko |Odl95l] to which we have already 
referred is somewhat more extensive. As mentioned before, contours such as the one in Figure [T] 
occur primarily in the univariate literatur e, but o ne recent extension to the multivariate setting, 
via a product of these contours, occurs in [GWOO|: their Lemma 3 gives a Darboux-type estimate 
for the (n, fci,..., fcj)-coefficient of a generating function asymptotic to (1 — Zi)““ ~ 

uniform as long as ki = 0(n) for all i. Finally, the book FS| is the most up to date, though it is 
only available in electronic preprint form at this time. 
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3. New multivariate results 


In [PW02t lPW04 IBP| the authors derived asymptotic formulae for Or, as |r| ^ oo, that are 
uniform as r/|r| varies over some compact set. It is useful to separate r into the scale parameter |r|, 
which is a positive real number, and a direction parameter r, which is an element of real projective 
space. Although r is always an element of the positive orthant of it will sometimes make sense 
to consider it as an element of When convenient, we identify r with its class f in projective 
space or its projection f in the real {d — l)-simplex the set {x S (K.+ )'^ : |x| = 1} (where 

|x| ^j)- Thus r may appear anywhere in the following diagram, where O and O denote the 

positive orthants of and respectively. 


O 


O 


A<i- 


1 




CP' 


1-1 


Quantities that depend on r only through its direction will be denoted as functions of r. The 
results we will quote in this section may be informally summarized as follows. Recall that F = G/F[. 

(i) Asymptotics in the direction f are determined by the geometry of the pole variety V = {z : 

= 0} near a finite set, Bcritr, of critical points [Definition 13.3j . 

(ii) For the purposes of asymptotic computation, one may reduce this set of critical points 
further to a set Bcontribjr C Bcritr of contributing critical points, usually a single point 
[Formula (13.4|) and Definition 13. 4| . 

(iii) One may determine Bcritf and Bcontrib^ by a combination of algebraic and geometric criteria 
[Proposition 13.111 and Theorem 13.151 respectively]. 

(iv) Critical points may be of three types: smooth, multiple or bad [Definition 13.9j . 

(v) Corresponding to each smooth or multiple critical point, z, is an asymptotic expansion for 
Or which is computable in terms of the derivatives of G and H at z [Sections 13.31 and 13.41 
respectively]. 

The culmination of the above is the following meta-formula: 

(3.1) Or ~ formula(z) 

zeBcontribr 

where formula(z) is one function of the local geometry for smooth points and a different function 
for multiple points. Specific instances of en) are given in equations (13.61) - (13.131) ; the simplest case 
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is 

Or ~ C\y\~^ Z”'' 

where C and z are functions of f. No general expression for formula (z) is yet known when z is a 
bad point, hence the name “bad point”. 


(3.2) 


Fundamental to all the derivations is the Cauchy integral representation 

1 f 


^F"(z) dz 


(2T:iY Jt 

where T is the product of sufficiently small circles around the origin in each of the coordinates, 1 
is the d-vec tor of a l l ones, and dz is the holomorphic volume form dzi A • • • A dza- The gist of the 
analyses in 


PW02; 


PW04 IBP|| is the computation of this integral. The key step is to replace the 


cycle T by a product Ti x T 2 , where the inner integral over Ti is a multivariate residue and the 
outer integral over T 2 is a saddle point integral. A complete asymptotic series may then be read 
off in a straightforward manner by well known methods. The first subsection below explains what 
critical points and cont ribu ting critical points are and gives the “big picture”, namely th e topolo gical 
context as outlined in BPj| . The second subsection gives the specific definitions from PW02l | that 


we need to find the contributing critical points (note: the question of sorting these into smooth, 
multiple or b ad criti cal po ints is a ddressed at the beginning of Section [4]). Sections [313] and [33] quote 


results from PW02l | and PW04| that give asymptotics for smooth points if they are respectively 
smooth or multiple. Section 13.51 restates some of these asymptotics in terms of probability limit 
theorems. 


3.1. Topological representation. As r ^ 00 , the integrand in (|3.2p becomes large. It is natural 
to attempt a saddle point analysis. That is, we try to deform the contour, T, so as to minimize 
the maximum modulus of the integrand. If f remains fixed, then the modulus of the integrand is 
well approximated by the exponential term exp (— |r|(f • log |z|)), where log |z| is shorthand for the 
vector (log jzi |,..., log \zd\)- This suggests that the real function 

(3.3) h(z) :=-f • log|z| 


be thought of as a height function in the Morse theoretic sense, an d that w e try to deform T so 
that its maximum height is as low as possible. Stratified Morse Theory GM88| solves the problem of 
accompl ishin g this optimal deformation. We now give a brief summary of this solution; for details, 
consult 


BP|. 


The variety V may be given a Whitney stratification. If V is already a manifold (which is the 
generic case), there is just a single stratum, but in general there may be any finite number of strata, 
even in two dimensions. The set of smooth points of V always constitutes the top stratum, with the 
set of singular points decomposing into the remaining strata, each stratum being a smooth manifold 
of lower complex dimension. To illustr ate som e of the possibilities, we give two examples; for further 
details on Whitney stratification, see [GMSSl Section 1.1.2]. 
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Example 3.1 (complete normal intersection). Let V be the variety where (1 — a::)(l — i/)(l — z) 
vanishes. The smooth points are those where exactly one of x,y or z is equal to 1. There are three 
one-dimensional strata where two but not all ofl — x, 1 — y and 1 — z vanish, and one zero-dimensional 
stratum at (1,1,1). 

Example 3.2 (isolated singular point). Let V be the variety {1 — x — y — z-\- Axyz = 0}. This 
has a zero-dimensional stratum at its unique singularity, (|, i, i); everything else constitutes the 
two-dimensional stratum. 

In each stratum, there is a finite set of critical points, namely points where the gradient of the 
height function restricted to the stratum vanishes. Since the height function depends on r via f, the 
set of critical points may be defined as a function of f or equivalently of r: 

Definition 3.3 (critical points). If S is a stratum ofV, define the set flcrit^[S) of critical points of 
S for H in direction r to be the set of z G S for which V/i|s(z) = 0. The set ficrit^ of all critical 
points of V is defined as the union of flcritp{S) as S varies over all strata. 


If S has dimension k then the condition V/i|s = 0 defines an analytic variety of co-dimension k. 
Generically, then, 6critf(S') is zero-dimensional, that is, finite. It does happen sometimes that for a 
small set of values of f the critical set is not finite; for example this happens when V is a binomial 
variety {z : z® — z*^ = 0}, in which case V/i|s vanishes everywhere on S for a particular r and 
nowhere on S for other values of r. We address such a case in Section li.lll Aside from that, such a 
degeneracy does not occur among our examples and a standing assumption (13.61 below) will ensure 
that it does not cause trouble in any case. 


If E is a rational function, the n the crit ical points are a finite union of zero-dimensional varieties, 
so elimination theory (see, e.g., CLO05L Chapters 1 and 2]) will provide, in an automatic way, 


minimal polynomials for the critical points; see ProDOsition l3. Ill below. So as to refer to them later, 
we enumerate the critical points as {z^^^, • • ■, 


Let A4 denote the domain of holomorphy of the integrand in (13.21) . that is, 

f 

M:=C‘^\<z:H-Y[zj=Q 

[ 

The topology of A4 is determined by neighborhoods of the critical points of V (the complement of 
A4) in a manner we now describe. Given a real number c, let denote the set of points of Ai 
with height less than c. Let AA'^ denote the AA^^ for some c greater than the greatest height of any 
critical point and let AA~ denote AA'^ for some c less than the least height of any critical point. 
If the interval [c, c'] does not contain a critical value of the height function, then is a strong 
deformation retract of , so the particular choices of c above do not matter to the topology and 
hence to the evaluation of the integral (13.21) . 
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Let X denote the topological pair {M'^,M~). The homology group Hd{X) is generated by the 
homology groups ,A4^ ) as [c', c"\ ranges over a finite set of intervals whose union contains 

all critical values. These groups in turn are generated by quasi-local cycles. At any critical point 
in the top stratum, the quasi-local cycle is the Cartesian product of a patch Vi diffeomorphic 
to inside V, and whose maximum height is achieved at with an arbitrarily small circle 7 ^ 
transverse to V. 


Figure 2. a piece of a quasi-local cycle at top-dimensional critical point 


Thus for example when d = 2, the quasi-local cycles near smooth points look like pieces of 
macaroni: a product of a circle with a rainbow shaped arc whose peak is at z^*^. We have not 
described the quasi-local cycles f or no n- maxima l strata, but for a description of the quasi-local 
cycles in general, one may look in BP| or GM 88 


Since the quasi-local cycles generate Hd{M^,M ), it follows that the integral (13.2p is equal to 
an integer linear combination 

(3.4) [ z~“’~^F(z) dz 

"z Ci 

where Ci is a quasi-local cycle near z and T = '^riiCi in ^ M.~). When z is in the top 

stratum, i.e., z is a smooth point of V, then Ci = Vi x 7 ^, the product of a d-patch with a transverse 
circle, so the integral may be written as exp[— |r|/i(z)]_F(z) dz. The inner integral is a simple 

'^For generic r, the function h is Morse. Each attachment map is a d-dimensional complex, so the attachments 
induce injections on HdiX). It may happen for some r that h is not Morse, but in this case one may understand 
the local topology via a small generic perturbation. Two or more critical points may merge, but the fact that the 
attachments induce injections on Hd{X) implies that such a merger produces a direct sum in Hd{X)-, in particular, it 
will be useful later to know that a cycle in the merger is nonzero if and only if at least one component was nonzero. 
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residue and the outer integral is a standard saddle point integral. The asymptotic evaluation of Or 
is therefore solved if we can compute the integers rn in the decomposition of T into riiCi. Observe 
that the contribution from is of exponential order no greater than exp(/i(z*^*))). It turns out 
(Theorem 13.191 and the formulae of Section 13.41) that for smooth and multiple points, this is a lower 
bound on the exponential order as well, provided, in the multiple point case, that G(z(*^) ^ 0. Thus 
if Ui 7 ^ 0, one may ignore any contributions from z^-l^ with and still obtain an 

asymptotic expansion containing all terms not exponentially smaller than the leading term. 

Definition 3.4 (contributing critical points). The set flcontrib = ficontribp of contributing critical 
points is defined to be the set o/z^'^ such that m 0 and nj = 0 for all j with h(z^^^) > h(z^'‘^). In 
other words, the contributing critical point(s) are just the highest ones with nonzero coefficients, Ui. 
Note that if there is more than one contributing critical point, all must have the same height. 


The problem of computing the topological decomposition may in general be difficult and no 
complete solution is known. In two dimensions, in the case where V is globally smooth, an effective 
algorithm is known for finding the contributing critical points. The algorithm follows approximate 
steepest descent paths; the details are not yet published vdHPj| . In any dimension, if the variety V 
is the union of h yperpla nes, it is shown in BP| how to evaluate all the np, see also some preliminary 
work on this in BM93(. In the case where V is not smooth, while no general solution is known, we 
may still state a sufficient condition for the critical point z^*) to be a contributing critical point. It 
is to these geometric sufficient conditions that we turn next. 


3.2. Geometric criteria. Let F,G,H and Or be as in (11.31) . If Or > 0 for all r, we say we are in 
the combinatorial case. We assume throughout that G and H are relatively prime in the ring of 
analytic functions on the domain of convergence of F. We wish to compute the function ficontrib 
which maps directions in O to finite subsets of V (often singletons). The importance of minimal 
points, defined below, is that when ficontrib^ contains a minimal point, this point may be identified 
by a variational principle. 


Definition 3.5 (notation for polydisks and tori). Let D(z) and T(z) denote respectively the polydisk 
and torus defined by 

D(z) := {z' : \z[\ < \zi\ for all I < i < d} ; 

T(z) := {z' : \z[\ = \zi\ for all I < i < d} . 


The open domain of convergence of F is denoted T) and is the union of tori T(z). The logarithmic 
domain of conv ergence, namely those x S with (e“L ■ ■ ■ j e^”^) € ^7 *5 denoted logi? and is always 


convex 


H6r9w . The image {log |z| : z G V} of V und er the co ordinatewise log-modulus map is 


denoted logV (this is sometimes called the amoeba ofV iGKZOA l). 


Standing assnmption 3.6. We assume throughout that logD is strictly convex, that is, its boundary 
contains no line segment. 
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This ensures that is always uniquely minimized on dlogD. The assumption is satisfied in all 
examples in this paper. It is a consequence of log V containing no line segments, which is true in all 
examples outside of Section 14.111 This may be checked by computer algebra; we do not include the 
checking of this fact in our worked examples. 

Definition 3.7 (minimality). A point z G V is minimal if all coordinates are nonzero and the 
relative interior of the associated polydisk contains no element of V, that is, V H D(z) C i9D(z). 
More explicitly, zt 0 for all i and there is no z' G V with |z'| < \zi\ for all i. The minimal point z 
is said to be strictly minimal if it is the only point ofV in the closed polydisk: V H D(z) = {z}. 


Re mark. This definition is equivalent to the apparently stronger definition of minimality stated 
in PWOij, namely that V n D(z) C T(z) - see Provosition \S.lS\ 

Proposition 3.8 (minimal points in Bcontrib). Recall the standing assumvtion \3.6\ 


(i) A point z gV with nonzero coordinates is minimal if and only if z G dV. 

(ii) All minimal points in ficontrib^ must lie on the same torus and the height hf(z) of the 
point(s) in ficontrib^ are at most the minimum, c, of the height function h^ on dT>. 

(iii) If z G flcontrib^D dV, then the hyperplane normal to r at log|z| is a support hyperplane to 
\ogV. 


To summarize, if ficontrib^ contains any minimal points, then these points minimize hr on D and they 
all project, via coordinatewise log-modulus, to a single point of dlogV where the support hyperplane 
is normal to r. 


Proof. A power series converges absolutely on any open polydisk about the origin in which it is 
analytic; this can be seen by using Cauchy’s formula (13.21) to estimate a^. Thus any minimal point 
is in the closure of T>. A series cannot converge at a pole, so z ^ T> and the first assertion follows. 

To prove the second assertion, by strict convexity there is a unique z G dT> that minimizes h. Let 
H be the homotopy {tT(z) : e < t < 1 — e}. This may be extended to a homotopy H' that pushes 
the height below c — e except in small neighborhoods of V lb T(z). Thus the small torus T in (|3.2I1 
is homotopic to a cycle in the union of and a neighborhood of V lb T(z). Sending e ^ 0, it 

follows that Bcontrib has height at most c. If there is a line segment in dlogV, it is possible that the 
minimum height on V occurs on more than one torus, but in this case, if z and w are on different 
such tori, the above argument shows that neither z nor w can be in Bcontribr. 

The third assertion is immediate from the fact that log |z| minimizes the linear function hf on 
the convex set \ogT>. □ 

Having characterized Bcontribf when it is in dV, via a variational principle, we now relate this to 
the algebraic definition of Bcritjr which is better for symbolic computation. It is easy to write down 
the inverse of Bcritr, which we will denote by L. 
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Definition 3.9 (geometry of minimal points). Say that z is a smooth point ofV ifV is a manifold 
in a neighborhood of z. Say that z is a multiple point of V if V is the union of finitely many 
manifolds near z intersecting transversely (that is, normals to any k of these manifolds span a space 
of dimension min{fc,d}j. Points that are neither smooth nor multiple are informally called bad 
points. 


Rem ark . 
e.g., \M]). 


The transversality assumption streamlines the arguments but is not really needed (see, 


Definition 3.10 (the linear space L). Let z € V be in a stratum S. Let L(z) C denote the 

span of the projections of vectors {ziVi ,..., ZdVd) as v ranges over vectors orthogonal to the tangent 
space of S at z. 


The Figure [3] gives a pictorial example of the foregoing definitions (some parts of the figure refer 
to Definition 13.131 belowl. The singular variety V for the function F{x,y) = l/[(3 — x — 2y)(3 — 
2x — y)] is composed of two complex lines meeting at (1,1). The crossing point zq := (1,1) is a 
singleton stratum. The following proposition, showing why L is important, follows directly from the 
definitions. 



Away from z^, K(z) is a real 
ray (shown) and L(z) is the 
complex line containing K(z). 

L(Zo ) = allofC^ 

K ( Zq ) = shaded cone in F? 


Figure 3. direction L(z) for positive real points of V when l/F = {3 — x — 2y){3 — 

2x-y) 

Proposition 3.11 (L inverts ficrit). The point z is in flcrit^ if and only ifr G L(z). Ifz is a smooth 
point, then (replacing FI by its radical if needed), L(z) is the singleton set (in projective space) 

L(z) = {Viogi/(z)} := ||^zi|^(z),...,zd|£(z)^| 

while the set ficrit^ is the (usually zero-dimensional) variety given by the d equations 

H = Q 


dH 


dH 




(3.5) 


{l<j<d-l) 







20 


If z is a multiple point, then L(z) is the span of the vectors 

^ ^ , ( dHk, , dHk, 

:= (z)j 

where H = UkHk- □ 

Connecting this up to the variational principle, we have: 

Proposition 3.12 (smooth minimal points are critical). Ifz is minimal and smooth then L(z) G O 
and is normal to a support hyperplane to logD at log|z|. Consequently, a minimal smooth point z 
is a critical point for some outward normal direction to logi? at log |z|. 

Proof. Assume first that none of the logarithmic partial derivatives hj := ZjdH/dzj vanishes at z. 
Suppose the arguments of two of the partials hk and hi are not equal. Since we have assumed no hj 
vanishes, there is a tangent vector v to V with vj = 0 for j {k, 1} and Vk 0 vi. We may choose 
a multiple of this so that Vkhk and vi,hi both have negative real parts. Perturbing slightly, we may 
choose a tangent vector v to V at z with all nonzero coordinates so that vfhj has negative real part 
for all j. This implies there is a path from z on V such that the moduli of all coordinates strictly 
decrease, which contradicts minimality. It follows that all arguments of hj are equal, and therefore 
that L(z) is a point of O C CP^^; hence it is normal to a support hyperplane to logi? at log |z|. 

Now, suppose hj = 0 for j in some nonempty set, J. Let x = log |z|. Vary {zj : j ^ T} by 0(e) so 
as to stay on V, varying {zj : j £ J} by O(e^). We see that the complement of logi? contains planes 
arbitrarily close to the | J|-dimensional real plane through x in directions {cj : j S J}, whence the 
closure of the complement contains this plane. But by monotonicity, the orthant of this plane in 
the —Cj directions must be in the closure of logD. Thus this orthant is in d\ogD, whence there is 
a lifting of this orthant to V. Now the argument by contradiction in the previous paragraph shows 
that if the arguments of hk and hi are unequal for some k,l ^ J, then we may move on V so the 
moduli of the Zj for j ^ J strictly decrease, while moving down the lifting of the orthant allows us 
also to decrease the moduli of the Zj for j G J, and we have again contradicted minimality. □ 

Having related critical points to L(z) C CP'^“^, we now relate contributing critical points to a set 
K(z) C K.P'^”^. An open problem is to find the right definition of K(z) when z is not minimal. 

Definition 3.13 (the cone K of a minimal point). If z is a smooth point of V in dV, let Qdir(z) 
he defined as L(z), which is in O hy Provosition \S.1S[ For minimal z G V, not necessarily smooth, 
define K(z) to be the convex hull of the set of limit points of Qdir(jw) as w —> z through smooth 
points. 

For an example of this, see Figure[3l in which arrows are drawn depicting K(z) for smooth points 
z in the positive real quadrant and the two-dimensional cone K(zo) is shaded. The next proposition 
then follows from Proposition ld.lll and basic properties of convex sets. 
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Proposition 3.14 (description of K). Let z be a minimal point. 

(i) K(z) C L(z)nO. 

(ii) If z is a smooth or multiple point, then K(z) is the cone spanned by the vectors Viog-fffe of 
Proposition \S.11\ 

(iii) If a neighborhood of z in V covers a neighborhood o/log|z| in dlogD, then K(z) is the set 
of outward normals to \ogV at log |z|. 


The first nontrivial result we need is: 


Theorem 3.15 (K inverts ficontrib). Let z be a minimal point and either smooth or multiple. Then 
V G K(z) if and only if z G ficontrib^, provided, in the multiple point case, that G{z) ^ 0. 


Remark. Here, rather th an globa l mer omoryh icity, it is only required that F be meromorphic in a 
neighborhood o/D(z ); see \P W0^1 and \pWoI ] for details. 


Proof. Assume first that z is strictly minimal, that is, T(z) n V = {z}. 


S uppose for now that r is in the relative interior of K(z). Theorems PW02I . Theorem 3.5] 


and PW04 Theorems 3.6 and 3.9] give expressions for Or which are of the order |r|^z^'’, relying, in 
the multiple point case, on the assumption of transverse intersection, which we have built into our 
definition of multiple point, and on G'(z) ^ 0. 


On one hand, it is evident from (13.21) that 

Or = 0(exp[|r|(/if(w) +e)]) 

for any e > 0, where w G ficontribf. On the other hand, we know by Proposition 13.81 that no points 
higher than z are in ficontribf. Since ficontribf is nonempty, we conclude that it has points at height 
hf (z), whence from the expressions for Ur again, we see that z G ficontribr. 


In the case r G (?K(z) consider f' ^ r through the interior of K(z) (here we mean relative 
boundary and relative interior). There will be a contributing point zl which converges to z. The 
coefficient of the quasi-local cycle at z in the limit must be nonzero (see footnote Sj) . The theorem 
is now proven for strictly minimal points. 


To remove the assumption of strict minim ality, one must verify that this was not necessary for 
the formulae we quoted from PW02I : IPW04| . These formulae were proved by reducing the Cauchy 
inte gral to an integral over a neighborhood A/” of a (d — l)-dimensional subset 0 of V. It is pointed 


out PW02I . Corollary 3.7] that it is sufhcient to assume finite minimality, that is, finiteness of 
T(z) n V. In fact, one needs only finiteness of T(z) n 0, since the truncation of the integral to M 
incurs a boundary term that is sufficiently small as long as 0 avoids T(z). The remaining case, 
where 0 H T(z) is infinite, can be handled by contour rotation arguments, but since that work is 
not yet published, we point out here that in the case d = 2 (the only case used in this survey), the 
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set 0 is one -dimen s ional. T hus when 0 n T(z) is infinite, the set 0 must be a subset of T(z); the 
integrals in PW02i IPW04l | may then be taken over all of 0 with no truncation. □ 


We have remarked earlier that the main challenge in computing asymptotics is to identify ficontribr. 
Our progress to this point is that we may find all strictly minimal smooth or multiple points 
in ficontribr by solving the equation f G K(z) for z. This equation may be solved automati¬ 
cally when P" is a rational function and is often tractable in other cases, for example in the case 
F = {e^ — F>)j(xey — ye^) of Section [T7l 


To complement this, we would like to know when solving r G K(z) and checking for minimality 
does indeed find all points of ficontribr. It cannot, for instance, do so if there are no minimal 
contributing points. Thus we ask (a) are there any minimal points z with K(z) = r, (b) are all the 
contributing points minimal, and (c) is there more than one minimal point? It turns out that the 
answers are, roughly: (a) yes, in the combinatorial case, (b) yes as long as (a) is true, by part (ii) of 
ProDOsition l3.8l and (c) rarely (we know they are never on different tori). 


Let SCO denote the set of all normals to support hyperplanes of \ogD. If the restriction of 
F to each coordinate hyperplane is not entire, then H is the entire nonnegative orthant. This is 
because \ogT> has support hyperplanes parallel to each coordinate hyperplane. When H is not the 
whole orthant, then in directions r ^ O, the quanti ty a-r is either identically zero or decays faster 
than exponentially (this follows, for example, from PW04 (1.2)], since —f • logjzj is not bounded 
from below on V). 


Theorem 3.16 (existence of minimal points in the combinatorial case). Suppose that we are in the 
combinatorial case, > 0. Under the standing assumvtion \3.6\ for every f in the interior of a, 
there is a minimal point z G V which lies in the positive orthant ofMf" and has f G K(z). 


Remark. At this point, it is clear how far one can generalize beyond rational functions. Given a 
compact set K G a in which one wishes to find asymptotics. Let \ogD(K) denote the set of all x 
such that X < y coordinatewise for some y G dlogD whose normal direction is in K. Let 'L>{K) be 
the inverse image under the coordinatewise log-modulus map of\og'D{K). Then, in the combinatorial 
case, for the existence of a minimal point z in the positive orthant with r G K(z), it is sufficient 
that F be meromorphic in a neighborhood of'D{K). 


Proof. We follow the proof of PW02L Theorem 6.3]. For any r in the interior of H there is a point 
X G dlogV with a support hyperplane normal to r. Let Zi = e“* so z is a real point in dV. We 
claim that z G V. To see this, note that there is a singularity on T(z), which must be a pole by the 
assumption of meromorphicity on a neighborhood of D(z). Together, Or > 0 and lack of absolute 
convergence of the series on T(z) imply that F(w) converges to -l-oo as w ^ z from beneath. By 
meromorphicity, z is therefore a pole of F", so z G V. 
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We conclude that there is a lifting of 91ogI? to the real points of V, that is, a subset of the real 
points of V maps properly and one to one onto 91ogI?. By the last part of ProDOsition l3. 141 it follows 
that f S K(z), and hence from Theorem 13.151 that z S ficontribf. □ 

Remark. By convexity oflogD, the point z G O nflcrit{r) in Theorem \3.16\ is unique unless there 
is a line segment in d\ogT>. In this case there is a continuum of such z. Since in almost every 
example the critical point variety is finite, there will be precisely one critical point in the positive 
orthant. Thus in practice we will have no difficulty in determining the point z. 


We say that a power series P is aperiodic if the sublattice of of integer combinations of 
exponent vectors of the monomials of P is all of ifi. By a change of variables, we lose no generality 
from the point of view of generating functions if we assume in the following proposition that P is 
aperiodic. 

Proposition 3.17 (often, every minimal point is strictly minimal). If H — 1 — P where P is 
aperiodic with nonnegative coefficients, then every minimal point is strictly minimal and lies in the 
positive orthant. 


Proof. Suppose that z = xe*®, with 0^0, 


is minimal. Since z S V, we have 


1 


arX*" exp(ir • 0) 

r#0 


— . 

r#0 


Thus z can be strictly minimal only if 0 = 0. 


Equality holds in the above inequality if and only if either there is only one term in the latter 
sum, or, in case there is more than one such term, if exp(jr • 0) = 1 whenever Or 0. The former 
case occurs precisely when P has the form ex'’ and the latter can occur on when P has the form 
P(z) = g{z^) for some b 7 ^ 0, both of which are ruled out by aperiodicity. □ 


We restate the gist of Theorem 13.161 and ProDOsition l3.17l as the following corollary. 

Corollary 3.18. If H = 1 — P with P aperiodic and having nonnegative coefficients, then the 
contributing critical points as r varies are precisely the points of V O that are minimal in the 
coordinatewise partial order. The point z is in ficontribf exactly when r S K(z). 


Without the assumption H = 1 — P, assuming only > 0, the same holds except that there might 
be more contributing critical points on the torus T(z). □ 
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3.3. Asymptotic formulae for minimal smooth points. It is shown in PW02l | how to compute 
the integral (j3.2|) when f is fixed and ficontribr is a finite set of smooth points on a torus. Given a 
smooth point z G V, let /z be the map on a neighborhood of the origin in taking the origin to 
zero and taking (0 i,..., 9d-i) to logic for w such that 

e V. 

The following somewhat general result is shown in PW02I . Theorem 3.5]. 


Theorem 3.19 (smooth point asymptotics). Let K <Z a be compact, and suppose that for r G K, 
the set ficontribp is a single smooth point z(r) and that /z has nonsingular Hessian (matrix of second 
partial derivatives). Then there are effectively computable functions bi{r) such that 

(3.6) Ur ~ z(r)"-- ^ 5z(f)(rd)-('+''-i)/2 

Z>0 

as an asymptotic expansion when jrj ^ oo, uniformly for r G A. □ 

Remark. In connection with Theorem, \S.19[ the following points should be noted. 


(i) The coefficients bi depend on the derivatives of G and H to order k + I — \ at z,{r), and 
bo{r) = 0 if and only i/G(z(r)) = 0. 

(ii) The sum on the right of eaua,tion \S.6\ mau he rewritten as a sum o/6*(r)|r|“(^+‘^“^)/^ where 

~ in order to see what depends on f and what depends on jrj. 

(iii) Suppose ficontrih^ is a finite set of points ... z^^^, satisfying the hypotheses of the theo¬ 
rem. Then one may sum over these, obtaining 

Ur ^ ^zW(f)--^6:,(f)|rr('+‘^-i)/^ 

k l>0 


Theorem 13.191 is somewhat messy: even when ^ = 1, the combination of partial derivatives of 
G and H is cumbersome, though the prescription for each bi in terms of partial derivatives of G 
and H is completely algorithmic for any I and d. In the applications herein, we will confine our 
asymptotic computations to the leading t erm. Th e following expression for the leading coefficient is 
more explicit than (13.6p and is proved in PW02L Theorems 3.5 and 3.1]. 


Theorem 3.20 (smooth point leading term). When G(z(r)) 0, the leading coefficient is 


(3.7) 


bo = 


- 1 / 2 . 


G(z(r)) 


-ZddH/dzd 

where TL denotes the determinant of the Hessian of the function /z at the origin. 


Corollary 3.21. In particular, when d = 2 and G(z(r)) ^ 0, we have 


Qj^s 


/ -yHy 
y sQ{x,y) 


(3.8) 
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where {x^y) — z((r, 5)) and Q{x^y) is defined to be the expression 

(3.9) - {xH,){yHyf - {yHy){xH^f - [{yHyfx^H,, + {xH^fy^Hyy - 2{xH,){yHy)xyH^y] . 

□ 

Remark. In the combinatorial case, the expression in the radical will be positive real (this is true 
for more general F with correct choice of radical). The identity ryHy = sxHx (see Provosition IS.llV 
shows that the given expression for Ors, though at first sight asymmetric in x and y, has the expected 
symmetry. 


3.4. Multiple point asymptotics. Throughout this section, z will denote a minimal multiple 
point. We will let m denote the number of sheets of V intersecting at z and let 

m 

( 3 . 10 ) = 

denote a local representation ol H as a product of nonnegative integer powers of functions whose 
zero set is locally smooth. For 1 < fc < m, let denote the vector whose component is 
ZjdHk/dzj. 


We divide the asymptotic analysis of Or into two cases, namely m > d and m < d. In the 
former case, since we have assumed transverse intersection at multiple points, the stratu m of z is 
just the singleton {z}. The asymptotics of Or are simpler in this case. In fact it is shown in 
Theorem 3.1] that 


PemOC , 


(3.11) 


Or = Z 


■(P(r) + £;(r)) 


where P is piecewise polynomial and E decays exponentially on compact subcones of the interior of 
K(z). We begin with this case. 


We will state three results in decreasing order of generality. The first is completely general, the 
second holds in the special case m = d and = 1 for all k < m, and the last holds for m = d = 2 and 
ni = n 2 = 1. Our version of the mos t gener al result provides a relatively simple formula from BP | 
but under the more general scope of PW04 1. 

Definition 3.22. Let M = Z®’' 1 < j < M let tj be integers so that the multi¬ 
set {ti,... ,tM} contains Uk occurrences of k for 1 < k < m. Define a map —> by 

'l'(ej) = b^b). Let be Lebesgue measure on and let P(x) be the density at x of the push- 
forward measure X^ o dt with respect to Lebesgue measure under tf. The function P is a piecewise 
polynomial of degree M — d, the regions of polynomiality being no finer than the common refinement 
of triangulations of the set {bO) : 1 < fc < m} ir 


BL] . definition 5]. 


The above definition of P is a little involved but is made clear by the worked example in Sec¬ 
tion [?7TTJ Armed with this definition, we may say what happens when there are d or more sheets of 
V intersecting at a single point. 
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Theorem 3.23 (isolated point asymptotics). If m > d and 2 , is a minimal point in a singleton 
stratum, with G{z) ^ 0, then uniformly over compact subcones o/K(z), 

(3.12) ar~G(z)pf^,...,^')z-^ 

V -Zl ZdJ 

provided that ficontrib^ contains only z. This formula may be summed over ficontrihp as long as 
ficontribfr has finite cardinality. 

In the case z = 1, iS.12]) reduces further to 

ar^G(l)P(r). 

If furthermore, ar/G(l) are integers, then ar/G(l) is actually a piecewise polynomial whose leading 
term coincides with that of P{r). 


Proof. First assume z is strictly minimal. Theorem 3.6 of PW04| gives an asymptotic expression for 
Or valid whenever z is a strictly minimal multiple point in a singlet on s tratum. The formula, while 
not impossible to use, is not as useful as the later formula given in BPl . equation (3.8)]. This latter 
equation was d erived u nder the assumption that iJ is a product of linear polynomials. Noting that 
the formula in PWOdL Th eore m 3.6] depends on H only through the vectors : 1 < fc < m}. 


we see it must agree with BPl equation (3.8)] which is ()3.12|) . The last statement of the theorem 
follows from (|3.1ip . 


To remove the assumption of strict minimality, suppose that z is minimal but not strictly minimal 
and that there are finitely many points ... ,z(^^ in ficontribr, necessarily all on a torus. The 
torus T in may be pushed out to (1 + e)T(z) except in neighbo rhoods of each The same 
sequence of surgeries and residue computation in the pro ofs of PW04 Proposition 4.1, Corollary 4.3 
and Theorem 4.6] now establish Theorem 3.6 of PW04I] without the hypothesis of strict minimality, 
and as before one obtains (13.121) from (3.8) of BP|. □ 


While this gives quite a compact representation of the leading term, it may not be straightforward 
to compute P from its definition as a density. When M = m = d, P is a. constant and the 
computation may be reduced to the following formula. 

Corollary 3.24 (simple isolated point asymptotics). If m = d and each Uk = I in hS.KM . then if 
G(z) 0, and 2 is a minimal point in a singleton stratum, then 

Or ~ G(z)z”'’ JJ]”^ 

where J is the Jacobian matrix {dHi/dzj). This formula may be summed over finitely many points 
as in the remark following Theorem \3.19l □ 


Occasionally it is useful to have a result that does not depend on finding an explicit factorization 
of H. The matrix J can be recovered from the partial d erivatives of H. The result of this in the 
case m = d = 2 is given by the following formula PW04 Theorem 3.1]. 
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Corollary 3.25 (simple isolated point asymptotics, dimension 2). If M = m = d = 2, and if 
G(z) 7 ^ 0, then setting z = ix,y), 

—r —s y) 

Qrs ^ X y 


yj-x’^y'^n 

where H := HxxHyy — Hxy '^he determinant of the Hessian of H at the point (x, y). In the special 
but frequent case x = y = 1 we have simply 

G(l,l) 

^ • 

This formula may be summed over finitely many points as in the second remark following Theo- 
rem \d.lfA □ 


Finally, we turn to the case m < d. In this case, z is in a stratum containing more than just one 
point. The leading term of is obtained by doing a saddle point integral of a formula such as (13.121) 
over a patch of the same dimension as the stratum. Stating the outcome takes two pages in 


PW04|. 


Rather than give a fo rmula for the resulting constant here, we state the asymptotic form and refer 
the reader to 


PW04I . Theorem 3.9] for evaluation of the constant, bg. 


Theorem 3.26. Suppose that asr varies over a compact subset K of O, the set ficontribY is always a 
singleton varying over a fixed stratum of codimension m < d. If also z(r) remains a strictly minimal 
multiple point, with each smooth sheet being a simple pole of F, and if G(z) 7 ^ 0 on K, then 


(3.13) 

uniformly os jrj 


Or ~ z(r) '■ 6 o(r)|r| 


00 in K. 


□ 


3.5. Distributional limits. Of the small body of existing work on multivariate asym ptotics, a good 
portion focuses on limit theorem s. Consider, f or example, the point of view taken in Ben73t lBR,83 | 
and the sequels to those papers 


OR92 


BR99|, where one thinks of the the numbers Or as defining 
a sequence of (d — l)-dimensional arrays, the of which is the horizontal slice {ur : = fc} 

(cf. Section [L^ . Often this point of view is justified by the combinatorial application, in which 
the last coordinate, Vd, is a size parameter, and one wishes to understand rescaled limits of the 
horizontal slices. Distributional limit theory assumes nonnegative weights Or, so for the remainder 
of this section we assume we are in the combinatorial case, Or > 0. 

If Gfc := X]r'r-d=fc < 00 for all k, we may define the slice distribution pk on (d — l)-vectors to 
be the probability measure giving mass arjCk to the vector (ri,... ,rd-i) in the k*^ slice. There 
are several levels of limit theorem that may be of interest. A weak law of large numbers (WLLN) is 
said to hold if the measures A >—>■ pk{jA) converge to a point mass at some vector m (here, division 
of A by fc means division of each element by k). Equivalently, a WLLN holds if and only if for some 
mean vector m, for all e > 0, 


(3.14) 


lim pk 


{r: 


> e) = 0. 
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Stronger than this is Gaussian limit behavior. As r varies over a neighborhood of size Vk about 
km with rd held equal to A:, formula (13.61) takes on the following form: 

ar~G|r|(-(‘^-i)/2exp[</)(r)] . 

Here the factor varies only by o(l) when r varies by 0{Vk), and 

(l){r) := -r • log|z(r)|, 

or just —r • logz(r) when z(r) is real. The WLLN identifies the location rmax at which (p takes its 
maximum for rd held constant at k. A Taylor expansion then gives 

(3.15) exp [(/)(ri„ax + s) - </>(rmax)] = exp 

for some nonnegative quadratic form, B. Typically B will be positive definite, resulting in a local 
central limit theorem for {^ik}■ 


Unfortunately, the computation of B, though straightforward from the first two Taylor terms of 
H, is very messy. Furthermore, it can happen that B is degen erate, a nd there does not seem to be a 
good test for this. As a result, existing limit theorems such as BR831 Theorems 1 and 2] and 


GR92, 


Theorem 2] all contain the hypothesis “if B is nonsingular”. Because of the great attention that 
has been paid to Gaussian limit results, we will state a local central limit result at the end of this 
section. But since we cannot provide a better method for determining nonsingularity of B, we will 
not develop this in the examples. 


Distributional limit theory requires the normalizing constants Ck be finite. We make the slightly 
stronger assumption that Ur = 0 when r/r^ is outside of a compact set K. We define the map 


r I— » r 


\rd rd ) 


which will be more useful for studying horizontal slices than was the the previously defined projection 

r 1 -^ r. 


Theorem 3.27 (WLLN). Let F be a d-variate generating function with nonnegative coefficients, 
and with Ug = 0 for s/sd outside some compact set K. Let f(x) = F)!,..., 1, x). Suppose f 
has a unique singularity xq of minimal modulus which is a simple pole. Let r = 6(i*r(x), where 
X = (1,..., 1, xo). If F is meromorphic in a neighborhood o/D(x), then the measures {/ifc} satisfy 
a WLLN with mean vector r* := {rifrd, ■ ■ ■ ,rd-i/rd). 


Proof. We begin by showing that logi? has a unique normal at log |x|, which is in the direction f. 
We know that xq is positive real, since / has nonnegative coefficients. The proof of Theorem 13.161 
showed that a neighborhood A/” of x in V maps, via log-moduli, onto a neighborhood of log |x| in 
i91ogI?. Since xq is a simple pole of /, we see that JV is smooth with a unique normal ViL(x), whence 
logD has the unique normal direction f at log |x|. 
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We observe next that J2k whence, via Theorem l2.21 Ck ^ cxq ^. Thus Theorem l3.27l 

will follow once we establish: 


(3.16) 


lim sup — log 
k k 


E 


ys:sd, — k,s* — r* 



< -logxo. 


Each s* G K with |s* — r*| > e is not normal to a support hyperplane at log |x|, so for each such 
s* there is a y € dlogD for which 

(3.17) hg. (y) < hg. (x) = - log xo . 

By compactness of {s* € if : |s* — r*| > e} we may find hnitely many y snch that one of these, 
denoted y(s*), satisfies (13.171) for any s*. By compactness again, 

sup/is*(y(s*)) < -logxo . 


It follows from the representation (13.21) . choosing a torus just inside T(y) for y as chosen above 
depending on s, that |s|“^ logOg is at most — logxo — <5 for some 5 > 0 and all sufficiently large s 
with |s* — r*| > e. Summing over the polynomially many such s with Sd = k proves (13.161) and the 
theorem. □ 


Coordinate slices are not the only natural sections on which to study limit theory. One might, 
for example study limits of the sections {or : |r| = k}, that is, over a foliation of {d — l)-simplices 
of increasing size. The following version of the WLLN is adapted to this situation. The proof is 
exactly the same when slicing by |r| or any other linear function as it was for slicing by r^. 

Theorem 3.28 (WLLN for simplices). Let F be a d-variate generating function with nonnegative 
coefficients. Let f(x) = F[x,...,x). Suppose f has a unique singularity xq of minimal modulus 
which is a simple pole. Let r = Qdir(x), where x = (xq, ..., xq). If F is meromorphic in a neighbor¬ 
hood o/D(x), then the measures defined analogously to fj,k but over the simplices {|r| = k}, 

satisfy a WLLN with mean vector r. □ 

We end with a statement of a local central limit theorem. 

Theorem 3.29 (LCLT). Let E,/, xo,x, r and r* be as in Theorem \3.2T\ Suppose further that 
fScontrib^ is the singleton, {x}. If the guadratic form B from 1,9.1.5)) is nonsingular, then 

(3.18) lim sup l^fe(r) — n_B(i’)| = 0 

k —>-oo r 

where 

(3.19) nB(r) := (27r)“^‘^“^^/^ det(i?)“^/^ exp -i3(r)^ . 


is the discrete normal density. 


□ 
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4. Detailed examples 


In Sections|4]“[7]we work a multitude of examples. We aim to cover a sufficient variety of examples 
so that a user with a new application is likely to find a worked example that is similar to his or 
her own. With luck, the derivation of the result will be adaptable, and the user can thereby avoid 
original sources as well as the bulk of Section [3] of the present survey. The use of computer algebra 
is essential to many of these examples, and we preface the worked examples with a brief explanation 
of this. 


To apply any of the theorems of Sections |3. 31 - 13.51 one must solve for the point z G V as a function 
of the direction r and then plug this into a variety of formulae. This may always be done numerically, 
but there are advantages to doing it analytically when possible: one may then differentiate, compute 
asymptotics near a point of interest, solve in terms of other data, and so forth. Even when one is 
interested chiefly in a numerical approximation for a single Or, one often obtains better numerics by 
simplifying analytically as much as possible. 


When the generating function is rational or algebraic, as is quite frequent in applications, if one 
attempts to solve for z and plug in the results, one often finds that a computer algebra system 
cannot simplify an expression that one suspects should be simpler. Consequently, the computer 
cannot tell whether such an expression is zero, may have trouble plotting the expression, and so 
forth. As is well known in computational commutative algebra, these problems can be avoided by 
working directly with polynomial ideals. This survey incorporates the Groebner package in the 
computer algebra system Maple to illustrate the required computational algebra. We hope that 
these commands from Maple version 10 will remain in the Maple platform, but in any case, we 
include platform-independent explanations of the algebra. Our treatment i s necessarily qu ite brief, 
and readers who require more explanation should consult a source such as 


CLO05; 


Stu05 |. 


Suppose a point z G is the solution to polynomial equations Pi(z) = • • • = Pfc(z) = 0. The 
set of all polynomials P for which P{z) = 0 is an ideal, and, if the number of common solutions is 
finite, is said to be a zero-dimensional ideal. Such an ideal has many generating sets or bases, 
of which some are particularly useful to know. In particular, given a term order, that is, an order 
on the monomials z'’ that obeys certain properties, each ideal has a unique Grdbner basis. The 
lexicographic term order, known to Maple as plex, has the property that it forces the Grobner basis 
of a zero-dimensional ideal to contain a univariate polynomial in the variable designated to come 
last in the lexicographical order. Thus, if pi,P 2 and ps are polynomials in x,y and z with finitely 
many common solutions, then the commands (the first just loads the Grobner basis package into 
Maple) 


> with(Groebner); 

> L := Basis! [pi,p2,p3],plex(x,z,y)); 
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will produce a basis for the ideal of all polynomials vanishing on the common solutions to pi,P 2 
and p 3 . The last variable in the variable list is y, so the choice of term order plex causes the last 
polynomial of the basis L to be a polynomial in y alone, whose roots are the possible y-coordinates 
of the common solution points. 

A few words may prove helpful concerning the classification of critical points into smooth, multiple 
and bad points. A stratum always consists of points of a single topological type. The critical point 
equations are different for different strata. Therefore, sorting critical points by type occurs as a 
preliminary step of sorting strata by type. If one assumes the denominator H of F is square free (if 
not, a single Maple command allows one to pass to the radical), then the smooth points of V are 
precisely those where Vi? does not vanish. The first step is always to check whether there are any 
non-smooth points; generically, there are not, though interesting applications tend not to be generic. 
This step is accomplished by the single command 


> Basis ([H , diff(H,x) , diff(H,y) , diff(H,z)] , tdeg(x,y,z)); 


Maple returns the trivial ideal [1] if and only if there are no non-smooth points on V. We have 
used tdeg rather than the slower plex to check whether we get [1] ; if not, then we can go back and 
use plex to produce a more useful basis for the ideal corresponding to the variety of singular points. 
Further tests may be done to determine whether a stratum of singular points consists of multiple 
points: one must check whether the ra dical of t he ideal is reduced; for definitions and algorithms, 
the reader is referred to a text such as 


Mum95|. 


The algebra above is often sufficient to compute quantities such as the quantity ficritr, which is 
defined by the polynomial equations (13.511 . One may, however, go further: the substitution of one of 
these finitely many points into a polynomial expression such as (13.91) results in an algebraic number 
which Maple usually cannot write in its simplest form. Later, we discuss two reasonable ways to 
handle this. Given a quantity x defined by algebraic equations pi,... ,pfe, the most straightforward 
way to simplify the polynomial expression Q{x) is to reduce Q modulo the ideal generated by 
{pi,... ,pfc} (see Section 02])• An alternative is to obtain directly the minimal polynomial satisfied 
by Q{x). This may be done by elimination (see Section 14.41) or by matrix representation (see 
Section lT5l) . These techniques are useful for almost every example, but to avoid being repetitive, 
we often refer back to these fully worked computations. 


4.1. Binomial coefficients. To allow the reader a chance to check the application of Corollarv l3.21l 

in a familiar setting, we begin with an example where the numbers a^s are explicitly known. Let 
(r -I- s)! 


^rs — 


r s 
r, s 


r! s! 


be the binomial coefficients and let 


Fix,y) = X! ^rsx'^y” . 

r,s>0 
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The binomial coefficients satisfy the recurrence ar,s = Or-i.s + ar.,s-i and this holds for all (r, s) ^ 
(0,0), provided that we take Ors to be zero when either r or s is negative. Linear recursions for 
ars in terms of values with {r',s') < (r, s) in the coordinatewise partial order lead easily to 

rational generating functions (as discussed in Section [71 more general linear recursions do not yield 
rational functions). To find F, we observe from the recursion that all the coefficients of {1 — x — y)F 
vanish except the (0,0)-coefficient, which is 1. Thus 


F{x,y) 


1 

1-x-y' 


Let us compute the set Bcontribr and the quantities appearing in Corollary 13.211 The singular 
variety V is the complex line x + y = 1. The numerator of T’ is 1, and in particular, it never vanishes. 
For any z S V, the space L(z) is the linear span of z. To see this, either use Proposition 13 .11 1 or note 
that the tangent space to V is everywhere orthogonal to (1,1) and plug this into Definition 13.101 


For each direction r in the positive real orthant, there is thus a unique solution z G V to r € L(z), 
namely z = f = (y^,y^) for any representative (r, s) of r. One may apply Theorem 13.161 to 
conclude that Bcontribr = {?}. 


To apply Corollary 13.211 we need only compute the quantity Q in dSlD and verify that it is 
nonzero. We have = 1 — x — y, Fl^ = — 1, Fly = — 1 and all other partial derivatives of H are zero, 
whence Q = -xy{x + y) and plugging in (x, y) = (;:^, ;:^) gives 

-yFly _ 1 _ r -I- s 

sQ{x,y) sx{x + y) rs 

Substituting this and G = 1 into (13.81) gives 


(4.1) 


r±±Y f j r + s 

r J \ s y V 27 rrs 


This asymptotic expression is valid as (r, s) oo, uniformly if r/s and s/r remain bounded - see 
the uniformity conclusion in The oreml3.19l In fact, we know from Stirling’s formula that this holds 


uniformly as min{r, s} —> oo; see Lla| to obtain the latter result in the present framework. 


Although we have not presented precise error bounds in our asymptotic approximations, the errors 
for such smooth bivariate problems can be shown to be of the order 1/s and our approximations 
are good even for moderate values of r and s. As a randomly chosen example, we observe that the 
approximation above yields a relative error of about 0.8% when r = 25, k = 12, while the relative 
error has decreased to about 0.4% when r = 50, A: = 24 and 0.08% when r = 250, s = 120. 


4.2. Delannoy numbers. Recall the Delannoy numbers a^s that count paths from the origin to 
the point (r, s) with each step having displacement (1,0), (0,1) or (1,1). In exactly the same way 
























33 


that we obtained the generating function for the binomial coefficients, we may use the recursion 
ar,s = Or-i.s + ar,s-i + ar-i,s-i Valid for all (r, s) except (0, 0) to obtain the generating function 


F{x,y) 


arsX^y’^ 

r,s>0 


1 

1 — X — y — xy 


so the denominator is given hy H = 1 — x — y — xy. 


Solving for Bcontribf is only a little more involved for this generating function than it was for 
1/(1 — X — y). However, performing all calculations by hand, as we did in the case of binomial 
coefficients, is tedious for this example and completely impractical for later examples. The step- 
by-step computation below illustrates how a computer algebra system such as Maple can carry the 
derivation to completion at a symbolic level. 

The linear space h{x,y) is the one-dimensional complex vector space spanned by (xHx,yHy) = 
(—a:(l -I- y),—y{l -\- x)). The most convenient way in which to solve the equations H = Q and 
r G L(x,y) is to obtain a Grobner basis. We load Maple’s Groebner package using the command 
with(Groebner) and then execute the following command. 


> GB:=Basis( [H, s*x*diff(H, x) - r*y*diff(H, y)], plex(x,y)); 

This computes a basis for the ideal corresponding to the common solutions to ff = 0 and (r, s) G 
L(x,y). The answer is GB := [—s -|- sy'^ -\- 2ry,s — sy — r + rx] =: [pi{y),p 2 (x,y)]. The first 
element of the basis is an elimination polynomial in y alone. Solving = 0 yields the two values 
y = (—r ± y'r'^ + s^)/s. For each of the two y-values there is a unique a;-value obtained by solving 
P 2 for X. One might observe that symmetry tells us these will be (—s ± -|- s^)/r; however we 

need p 2 to tell us which x-value goes with which y-value. We find that there is a unique positive 
solution 

- . Vr^+^ — s 

x(v) = -; 

r 

- r 

2/(i-) = -• 

s 

By Theorem 13.161 we see that ficontribr = {x{r),y{r)) for any positive direction r. The numerator, 
G, is again identically 1. It remains only to plug the values for x and y into expressions for Q and 
ars and simplify. 

Using the definition of Q and simplifying, we obtain Q = xy{l + a;)(l -I- y){x + y). Let us reduce 
Q modulo H by the command 


NormalForm (Q, [H, s*x*diff(H ,x) - r*y*diff(H, y)], plex(x,y)); 


to get 
(4.2) 


Q = —2x^ — 2y^ -I- 6a; -I- 6y — 4 . 
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Finally, substituting x = a;(r) and y = y{r) into —yHy/{sQ) yields 

-yHy _ _rs_ 

sQ + s^(r + s — 


and putting this all together yields the expression (El for the Delannoy numbers: 


— s 


+ s2 — r 


rs 


2'k Y Vr2 + s'^ir + s - i/r^ + 


For example, putting r = s = n shows that the central Delannoy numbers a„„ have first order 
asymptotic approximation 


4.3. Powers, quasi-powers, and generalized Riordan arrays. It often happens that we wish 
to estimate [z'^]v{z)^, that is, the coefficient of a large power of a given function v{z). Clearly, 
this is equal to the x^y^ coefficient of the generating function 


(4.3) 


F{x,y) := 


1 


1 - yv{x) 

One place where this arises is in the enumeration of a combinatorial class whose objects are strings 
built from given blocks. Let v{z) := count the number a„ of blocks of size n. Then the 

generating function (14.31) counts objects of a given size by the number of blocks in the object. 


Example 4.1. A long sequence of zeros and ones may be divided into blocks by repeatedly stripping 
off the unique initial string that is a leaf of T, a given prefix tree. Lempel-Ziv coding, for instance, 
does this but with an evolving prefix tree. When v(x) is the generating function for the number a„ 
of leaves of T at depth n, then 1/(1 — yv{x)) generates the numbers a^s of strings of length r made 
of s blocks (the final block must be complete). 


Another place where generating functions of this form arise is in the Lagrange inversion formula. 
This application is discussed at length in Section d but briefly, if h solves the equation h{z) = 
zv{h{z)), then even if we cannot explicitly solve for h, its coefficients are given by 


[z-]h{z) = -[z-M^r- 

This identity has been very p rofitable in the analysis of planar graphs and maps, cf. the discussion 
of results of 


GW99 


BFSS01 | in Section 1^31 


A third place where coefficients of powers arise is in sums of independent random variables. Let 
^( 2 ^) = be the probability generating function for a distribution on the nonnegative 

integers, that is, a„ = P(Aj = n) where {Xj} are a family of independent, identically distributed 
random variables. Then v(z)" is the probability generating function for the partial sum 5'„ := 
Xj, and hence 

p(5„ = k) = [z’^]v{zr ■ 
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A Riordan array is defined to be an array {ank ■ n,k>0} whose generating function F{x,y) := 
Sn,fc>o o.rikX^y^ satisfies 


(4.4) 


F{x,y) 


1 - yv[x) 


for some functions 4> and v with ?;(0) = 0 and 4){Q) ^ 0. If in addition ^'(O) ^ 0 the array is 
called a proper Riordan array. Just as (14.3p represents sums of independent, identically distributed 
random variables when r; is a prob ability g enerating function, the format (14.411 generalizes this to 


delayed renewal sums (see, e.g. Dur04 Section 3.4]), where an initial summand Xq may be 


added that is distributed differently from the others. The quasi-powers (HH) arising in GF-sequence 
analysis, which were described in Section 11.21 are asymptotically of this form as well. Thus (|4.4I) 
approximately encompasses most of the known results leading to Gaussian behavior in multivariate 
generating functions. 


Riordan arrays have been widely studied. In addition to enumerating a great number of combi¬ 
natorial classes, Riordan arrays also behave in an interesting way under matrix multiplication (note 
that the condition r!(0) = 0 implies = 0 for k < n, and, by triangularity of the infinite array, 
that multiplication in the Riordan group is well defined). Su rveys of the Riordan group and its 
combinatorial applications may be found in [SDr94 ISGWWQl |. 


As we will see in this section, the asymptotic analysis of these arrays is relatively simple, or at 
least, is no more difficult than analyses of the functions (j) and v that define the array. One should 
note, however, that Riordan arrays are often defined by data other than (f> and v. Gommonly, one has 
a linear recurrence for a„,fc+i as a sum the generating function A{t) := 

is known and is f airly simple, but the function v{x) in (14.41) is known only implicitly through an 
equation ([RogTSi, Equation 6]) 


v{x) = xA(v{x)) 


which is reminiscent of Lagrange inversion. The paper Wil05| . which is devoted to bivariate asymp¬ 
totics of Riordan arrays, discusses at length how to proceed when the known data includes A{t) 
rather than v{x). Thus there are versions of (14.101) below, such as Proposition 16.21 which state 
asymptotics in terms of A without explicit mention of v. The discussion in this section, however, 
will be limited to deriving asymptotics in terms of (j) and v. For our analyses it is not important 
to require r’(O) = 0 (for example, neither the binomial coefficient nor Delannoy number examples 
above satisfies that condition), so we drop this hypothesis and consider generalized Riordan arrays 
that satisfy (03) but may have r'(O) ^ 0. 
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The following computations show that the quantities L(a;, y) and Q{x, y) turn out to be relatively 
simply expressed in terms of the function v{x). Define the quantities 

xv'(x) 


(4.5) 


y(v]x) := 


}{x) 


(4.6) 


(T^(u; x) 


x'^v"{x) 

v{x) 


+ y{v, x) — y{v; x)^ 


dy,(v; x) 
^ dx 


It is readily established that for {x,y) G V, we have L(x,y) = (/j(v; x), 1). In other words, 
(x,l/v(x)) G Bcrit(r,s) if and only if sy.(v;x) = r. Furthermore, when this holds, Q{x,l/v{x)) = 
x). Provided that (j) and are nonzero at a minimal point, the leading term of its asymptotic 
contribution in (13.811 then becomes 


(4.7) 


Qjfs 


..-r..,,,. !/-(»)) 

y/2Trsa^{v; x) 


where /i(z;; x) = r/s. 


The notations /i and are of course drawn from probability theory. These quantities are always 
nonnegative when v has nonnegative coefficients. To relate this to the limit theorems in Section [331 
observe that setting a; = 1 gives 


(4.8) 


(4.9) 


yiv] I) = 


cr^(u;I) = 


u(I) ’ 


-Wf+'. 


-(!)• 


Thus, under the hypotheses of Theorem 13.271 a WLLN will hold with mean m = /i(r!; 1). Of course 
we see here that y,{v] I) is simply the mean of the renormalized distribution on the nonnegative 
integers with probability generating function v. Similarly, we see in Theorem l3.29l that B{r, s) = {s — 
/r(u; l)r)^/cr^(u; 1) is the Gaussian term corresponding to the variance cr^(w; I) of this renormalized 
distribution. 


Suppose we are in the combinatorial, aperiodic case: the coefficients of v are nonnegative and 
v{x) cannot be written as x^g(x'^) for any power series g and d > 1. The set V is the union of 
the set Vo, parametrized as (x, l/v{x)), with the union of horizontal lines where the value of a: is a 
singular value for v or cj). As x increases from 0 to R, the (possibly infinite) minimum of the radii of 
convergence of v and (p, it is easy to verify that all the points of Vo encountered are strictly minimal, 
and that a'^(v;x) > 0. Since the derivative of g{v\x) is x)/x, this shows that g{v]x) is strictly 
increasing on (0, R). Thus A := fi{v; 0) and B := y{v] R) are well defined as one-sided limits and for 
A < X < B there is a unique solution to y) = A. In fact, A is the order of v at 0, so equals I for 
proper Riordan arrays, but may be 0 for generalized Riordan arrays or an integer greater than 1 for 
improper Riordan arrays. From this it is evident that Ors = 0 for r/s < A, so we should not have 
expected a solution to g(v;x) = X when A < A. If i? = oo, then B is the (possibly infinite) degree of 
V, and one has again that Ors = 0 for r/s > B. If u or <() is not entire, then one cannot say without 
further analysis what one expects for Urs when r/s —> X > B. To summarize: 
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Proposition 4.2. Let (v{x),4>{x)) determine a generalized Riordan array. Suppose that v{x) has 
radius of convergence R > 0 and is aperiodic with nonnegative coefficients, and that </> has radius of 
convergence at least R. Let A,B be as above. Then for A < r/s < B, 

OO 

(4.10) Urs v{xyx~''hk{r/s)s~^ 

fe =0 

where x is the unique positive real solution to iJ,{v,x) = r/s. Here bo = . y 0. The 

yj 2-K(7^ 

asymptotic approximation is uniform as r/s varies within a compact subset of {A, B), whereas ars = 0 
for r/s < A. □ 


We note that if the combinatorial restriction is lifted, much more complicated behavior can occur. 
The generating function 3/(3 — 3a; — j/ + x^) is of Riordan type with (j){x) = v{x) = (3 — 3x + x^)“^, 
and even though v is aperiodic, ficontribr has cardinality 2. Furthermore, at the unique contributing 
point for the diagonal direction, vanishes. 

The condition on the radius of convergence of /> is satisfied in most applications. One way in 
which it may fail is when F" is a product of more than one factor. We will see an example of this 
in Section HU The next two subsections consider applications of Proposition 14.21 to combinatorial 
applications. 


4.4. Maximum number of distinct subsequences. Flaxman, Harrow and Sorkin FHS04l| con¬ 
sider strings over an alphabet of size d which we take to be {1,2, ...,c?} for convenience. They 
are interested in strings of length n which contain as many distinct subsequences (not necessarily 
contiguous) of length k as possible. Let a„fe denote the maximum number of distinct subsequences 
of length k that can be found in a single string of length n. Initial segments S'|„ of the infinite 
string S consisting of repeated blocks of the string 12 - ■ ■ d turn out always to be maximizers, that 
is, S\n has exactly Unk distinct subsequences of length k. The generating function for {a„/c} is given 
by FHS04, equation (7)]: 

F{x,y) = = z -^ • 

n,k 

This is of Riordan type with (/{x) = (1 — x)~^ and v{x) = x + x^ + ■ ■ ■ + x‘^. 


The case d = 1 is uninteresting. Suppose that d > 2. The singular variety V is the union of the 
line X = 1 and the smooth curve y = l/v(x) and they meet transversely at the double point (1,1/d); 
see Figure HI for an illustration of this when d = 3. 


This is a case where the radius of convergence of is less than the radius of convergence of v, 
the former being 1 and the latter being infinite. We have /i(u; x) = 1/(1 — x) — dx‘^/(l — x'^) = 
(1 -I- 2x -I- 3x^ -I- ■ • • -I- dx'^“^)/(l -I- X -I- x^ -I- • • • -I- x'^~y. As x increases from 0 to 1 (the radius 
of convergence of />, which is the value of x at the double point), p, increases from 1 to (d -|- l)/2. 
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X 


Figure 4. V in the case d = 3 

Thus when A := n/k remains in a compact sub-interval of (1,^^), the Gaussian asymptotics of 
equation (|4.10p hold. 

To compute these in terms of A, we solve for x in 

Ti 

(4.11) n{v;x)=X:=- 

and plug this into 611). One can do this numerically, but in the case where u is a polynomial, one 
can do better. 


Solving /r(u;x) = A by radicals and plugging into (14.61) . which worked in Section 621 will not be 
possible when d > 5 and is not practical even for smaller d. However, we see that is algebraic, 
in the same degree d — 1 extension of the rationale that contains the value of x solving (14.1111 . We 
look therefore for a polynomial with coefficients in Q(A), of degree d — 1, which annihilates the 
in Proposition 14.21 To find this polynomial, the best tactic is to work directly with generators of 
polynomial ideals, and we give the details below. 


When d = 2, the solution 


x(A) 


A- 1 
2-A 


is a rational function of A and nothing fancy is needed to arrive at cr^ = (A — 1)(2 — A). We therefore 
illustrate with d = 3, though this procedure is completely general and will work any time u is a 
polynomial. 


Plugging in the expression (14.511 for x) in (j4.1ip and clearing denominators gives a polynomial 
equation for x: 

x±-A„ = 0. 

ax 
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In our example, 

(4.12) a;(l + 2a; + 3 — A (x + x^ + x^) = 0 . 


We now need to evaluate 


(4.13) 


cr^(v; x) 


d/r 

dx 


cc (l + 4a; + x"^) 
(1 + a; + a;^)^ 


at the value x that solves (14.1211 . 


To do this we compute a Grobner basis of the ideal in Q(A)[a;, S'] generated by fi{v,x) — A and 
a'^(v;x) — S (after clearing denominators). The commands 


pi:=(l+2*x+3*x~2)-lambda*(l+x+x"2): 
p2:=x*(l+4*x+x~2)-S*(l+x+x'2 )~2 : 
Basis! [pi, p2], plex(x, S)); 


produce the elimination polynomial 

p{S; A) = 3S^ + (6A^ - 24A + 16)S + 3A'‘ - 24A^ + 65A^ - 68A + 24 

which is easily checked to be irreducible (using Maple’s factor command, for example), and hence 
is generically the minimal polynomial for It is easy to choose the right branch of the curve: 
the function a:(A) increases in (0,1) as A increases in (1,2), and is given in (14.131) as an explicit 
function of x that is easily checked to be increasing. It follows from these that tr^ increases from 0 
to 2/3 as A goes from 1 to 2. 

To finish describing the asymptotics, we first note that values of A greater than d are uninteresting. 
It is obvious that any prefix of S of length at least dk will allow all possible /c-subsequences to occur. 
Thus Qnk = when A > d. 


We already know that as A := n/fc ^ (d+1)/2 from below, the asymptotics are Gaussian and the 
exponential growth rate approaches d. For slopes A > (d+ l)/2, we use Theorem [3T6] to see that for 
each such A, there is a minimal point in the positive quadrant controlling asymptotics in direction 
A. The only minimal point of V we have not yet used is the double point (1,1/d). It is readily 
computed that this cone has extreme rays corresponding to A = (d + l)/2 and A = oo, and thus 
asymptotics in the interior of the cone will be supplied by the double point. Using Gorollarv 13.251 
we obtain a\k,k ^ d^. 


Although Section [3] did not discuss what happens when A approa ches bu t is not equal to (d+l)/2, 
some results are known. A refinement of Gorollarv 13.251 given in PW04 Theorem 3.1 (ii)], shows 
that asymptotics in the boundary direction A = (d + l)/2 are smal ler by a factor of 2 than in the 
interior of the cone. In fact an examination of the proof there (see PW04 Lemma 4.7 (ii)]) shows 
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that one has Gaussian behavior, 


Qnk ~ d^^{x) for n = ^ k + x\j k 

where $(a:) = (27r)“^/^ e“‘ dt is the standard normal CDF and the constant — 1)/12 

is obtained in a manner similar to Q. This can also be obtained from a probabilistic analysis as 
follows. The quantity d~^ank is the probability that a uniformly chosen sequence of length A: is a 
subsequence of S\n- The length of an initial substring of S required to contain a given sequence 
U 1 U 2 • ■ ■ Uk is — Uj-i) mod d where 0 mod d is taken to be d and where uq := 0. Thus the 

probability of a uniformly chosen word of length k being a subsequence of S']™ is equal to 



where Uj are independent uniform picks from {1,..., d}. The central limit theorem now gives 


$(a:). 



4.5. Paths, hills and Fine nnmbers. Dyck paths are paths from the origin whose steps are in 
the set {(1,1), (1,0), (1, —1)} and that never go below the a:-axis. In Section 17^ we will count Dyck 
paths by their final point, which may be anywhere in the bottom half of the first quadrant. Often in 
the literature, the term Dyck path is reserved for a path constrained to end on the a;-axis. In DS0l|, 
a number of combinatorial interpretations are found for these constrained Dyck paths when counted 
by final a;-value and by the number of hills, a hill being a peak of height 1 (a peak is an occurrence 
of the step (1,1) immediately followed by (1,-1)). Let a„fc denote the number of Dyck paths from 
the origin to (2n, 0) that have k hills. Of particular interest are the values a„o which count hill-free 
Dyck paths and are called Fine numbers. 


(4.14) 


The generating function for {unk} is derived in [DSOll . Proposition 4]: 

n k 2 v{x)/x 

F(x,y) := > ttnkX y = - , -= - 

\ UK y 1 -C ,/1 - 

n,k 

with 

v{x) = 


1 -I- 2a; -b a/ 1 - 4a: - 2xy 1 - yv{x) 
1 - VI-4a; 2a; 


3 — VI — 4a; 1 -I- VI — 4a; -I- 2a: 

Rationalizing the denominator, we have 

1 -\-2x — VI — 4a; 


(4.15) 


){x) = 


4-b2a; 

and thus the domain of convergence is given by |a:| < 1/4, \yv{x)\ < 1. For 1 < A < 00 the critical 
points are determined by 

4a: 


y,{v]x) = 


(1 — Vl — 4a;) (Vl — 4a;) (3 — Vl — 4a;) 


= A. 
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The left side of the above equality increases strictly from 1 at x = 0 to oo as x t 1/4, and there is 
a unique positive real solution xx for each A > 1. Figure [5] shows x increasing from 0 to 1/4 and v 
increasing from 0 to 1/3 as A increases from 1 to oo. 



Figure 5. x and v plotted against A 

We now want to complete the computations by simplifying (j^(v;x) at the value of x satisfying 
/i(u;x) = A. We sketch here how the computations of Section generalize to the algebraic case. 
The algebra is only a little more involved than it is for rational v but some changes are clearly 
required. 

In order to perform Grobner basis computations, we must first pass from (14.151) to the implicit 
form a(x, v) = 0 where a is a polynomial in Q[x, v\. It is easy in this case to see by inspection that 

a{v, x) = (2 + x)v^ — (1 + 2x)x + x ; 

at worst, implicitizing will require going back to the original derivation and following the computation 
at the level of ideals Q Next, according to (113 and (113, differentiating a implicitly gives // and 


^Implicitizati on techni ques for polynomial or rational parametrizations x = f{t),v 
may be found in [cLOOsL Sections 3.2 and 6.4]. 


g{t), though not needed here, 
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as rational functions of v and x: 
(4.16) 




V = — - 


V = -- 


H” Ck.^CXyy ^iCXxv^ix^V 




xa, 

va- 


— Ax 
2v(2 + x) 

Sw"* + 16 u^ — 16 a:^ 
4v^(2 + x)^ 
x(4x — v^) 


a = — 


2v^(2 + x) 

X [xvaluxx + xvalayy - 2xvaxyaxay + vc?yOtx + xc?x^^ _ -x{8x^ - 4x‘^v‘^ - Sxv"^ + v'^) 


af,v^ v^(2 + xY 

Setting fj, = n/k and clearing denominators gives a polynomial j3 = nvuy + kxax G Q[n, k] [x, u] that 
vanishes when fj,(y;x) = n/k. In our present example, we have 

I3{x, v) = {4n + 2nx + kx) + {—2 kx — n —2 nx) v + kx . 


To find the critical point corresponding to the direction n/k^ we could simply solve the equations 
a = 0,/3 = 0 for x,v. Then to simplify cr^, we add in the equation stating that cr^ = S' (with 
denominator cleared), and find a Grobner basis using the “plex” order with S as the last variable, 
just as in the previous section. Unfortunately, as is well known, such computations can be very slow, 
and in the present instance we have difficulty in obtaining an answer with Maple in a reasonable 
time. 


Thus we use the following alternative method, as described in CLO051 Section 2.2]. The poly¬ 
nomials a and [3 have finitely many common solutions {x,v) for any fixed n,k, so they define a 
zero-dimensional ideal, J. In other words, there are finitely many linearly independent monomials 
in X and v over Q[n, fc]/ (a,/3). A convenient choice is the set of monomials not divisible by any 
leading term of a fixed Grobner basis for J. With respect to this basis, one may express x and v as 
matrices over Q[n, k] for multiplication operators, then compute the matrix for cr^ as a rational func¬ 
tion of these matrices, and finally, find the minimal polynomial 7 for this matrix. The polynomial 
7 G Q[n, A:] will vanish at cr^. 


To carry this out in Maple, we use the following commands: 


sys:=[alpha, beta]: 

monomialorder := tdegCx, v): 

gb := Basis(sys, monomialorder): 

ns, rv := NormalSetCgb, monomialorder): 

Mx := MultiplicationMatrixCy, ns, rv, gb, monomialorder): 

Mv := MultiplicationMatrixCv, ns, rv, gb, monomialorder): 

Now evaluate the rational expression for given above, with x,v replaced by Mx,My (making 
liberal use of the simplification capabilities of Maple). We then compute the minimal polynomial 
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using the MinimalPolynomial command. The result is 
(32n^fc®) 

+z^ (—144n’^A;^ + 160n®fc^ — + An^k'^ + 2k^) 

+z (l44n® — 304n®/c + 203n^fc^ — 46n®fc^ — 15n^A:^ + 20n^fc® — llfc®n^ + lOn^A:^ — nfc®) 
+ (—ISn^A:^ + 31ri^A:‘^ — 27n® — 35n®fc^ — 21n^k^ + 57n^k + llk^n^ — ri^k’^) 


and computing eigenvalues (using solve) we obtain three possible values for cr^, namely 


52 

53 


(n — k){n + k){3n'^ + fc^) 

(9n — k + 3V9n^ — lOnfc + k^) (n — k)n 

4P 

(9n — k — 3V9n^ — lOnfc + /c^) (n — k)n 

4P 


Seeing which of these three gives the correct value for cr^ may be done similarly to the way it was 
in the previous section. Equation ()4.16|) gives as a univalent function of x and v. The correct 
branches for x and v as functions of A = n/k is shown in Figure [S] Plugging these into (14.161) and 
comparing to 5i, S 2 and S 3 (to avoid a numerical comparison, one may compare limits at 0) shows 
that second expression is correct: 


a" = 53 := 


(9n — k + 3\/9n'^ — lOnk + kS') (n — k)n 

AkP' 


To write ^(A) and u(A) in their simplest forms, observe that the values of x and v are sim¬ 
ply eigenvalues of the multiplication matrices Mx,My and so we obtain their respective minimal 
polynomials: 

4u^a:^ -I- (7n — k)(n — k)x — 2n{n — k ), 
and 2fcu^ -b 3(n — k)v — {n — k). 

Since we know that the relevant point has positive coordinates, we have the explicit forms 

(3n — k)^{n — k){9n — k) — {In — k){n — k) 

y/ {n — k){9n — k) — 3(n — k) 

V = — -. 

Ak 

As can be seen from this example, the computations can be rather messy, though still routine 
and automatable. In the end we have verified that cr^ > 0 for A > 1, and we conclude that Gaussian 
asymptotics given by Proposition 14.21 hold uniformly as n/k varies over any compact subinterval of 
( 1 , 00 ). 

Another way to simplify computations such as this one will be discussed in Section [SI Of course, 
for any given value of u, fc, we may shortcut the above computation and solve numerically for x 
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to obtain For example, with n/k = 2, we have x = (5vT7 — 13)/32 « 0.2379852541, so that 
y = 1/vix) = (3 + v^)/2 « 3.561552813. Hence 

a2k,k (0.1228255460 • • • )(4.957474791 • . 

As far as we know, asymptotics such as these have not previously been computed. Using 10 significant 
figure floating point approximations, we obtain (using Maple) an answer accurate to within 0.8% 
already for fc = 30. 

4.6. Horizontally convex polyominoes. A horizontally convex polyomino (HCP) is a union 
of cells [a, a + 1] x [ 6 , 6 + 1 ] in the two-dimensional integer lattice such that the interior of the figure 
is connected and every row is connected. Formally, if S' C and P = b)gs[a) a -I- 1] x [b,b+l] 
then P is an HCP if and only if the following three conditions hold: B := {b : 3a, (a, b) G S} is an 
interval; the set Af, := {a : (a, b) G S} is an interval for each b G Z; and whenever b,b + I G B, the 
sets Ai, and intersect. Let a„ be the number of HCP’s with n cells, counting two as the same 



Figure 6. an HCP with 13 cells and 4 rows 


if they are translates of one another. Polya P6169l | proved that 

a:(l — x)^ 


(4.17) 


^ a„a;" = 


1 — 5a; -I- 7x'^ — Ax^ 

n 

Fu rther d iscus sion of the origins of this fo rmula and its accompanying recursion may be found 


(4.18) 


Odl95l| and [Sta97|. The proof in [Wil94l . pages 150-153] shows in fact that 

xy{l — x)^ 

y = 

k 


F{x,y) = '^Qukx'^y'^ = 


(1 — x)'^ — xy(\ — X — x'^ + x"^ + x^y) 


where a„/c is the number of HCP’s with n cells and k rows. Let us find an asymptotic formula for 

CLps • 


All the coefficients of F{x, y) are nonnegative; they vanish when s > r but otherwise are at least 1. 
By Corollary 13.181 (the last part, which requires only Or > 0), we know that all points of V in the 
first quadrant that are on the southwest facing part of the graph (that is, that are minimal in the 
coordinatewise partial order) are contributing critical points. We do not know yet but will see later 
that there are no other critical points on each torus. As r varies over H = {r : 0 < s/r < 1} from 
the horizontal to the diagonal, the point Bcontribr moves along this graph from ( 1 , 0 ) to ( 0 ,oo). 

To make the mapping from r to z explicit, we use the fact that r G K(z) C L(z) lTheorem l3.1fil 
and part (i) of Proposition 13.141) . so z may be gotten from r by (13.51) . It is readily computed that 
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Figure 7. minimal points of V in the positive real quadrant 

Vi7 ^ 0 except at (1,0). Thus all minimal points are smooth. The only solution to G = H = 0 
is at (1,0), so the numerator is nonvanishing at any minimal point as well. Checking whether the 
quantity Q defined in equation (13. 9p of Theorem 13.211 ever vanishes, we find that the only solutions 
to H = Q = 0 are at (1,0) and at complex locations that are not minimal because L(z) is not real 
there iProposition 13.12)1 . Lastly, we must check that there are no contributing points other than 
the minimal points in the positive quadrant. We can ascertain that there are not by checking the 
“extraneous” critical points, which will lie on three other branches of a quartic (see below), and 
seeing that they lie on the wrong torus. We then know that the asymptotics for are uniform as 
s/r varies over a compact subset of the interval (0,1) and given by 

Urs . 

We will use Maple to determine x,y and C as explicit functions of A := s/r, giving asymptotics 
for the number of HCP’s whose shape is not asymptotically vertical or horizontal, but first we see 
what we may tell without much computation. 

A crude approximation at the logarithmic level is 

ars « exp [r(- log a; - {s/r) logy] 

where x and y of course still depend on s/r. We first compute the average row length of a typical 
HCP. We may apply Theorem [3221 to find the limit in probability of hk/k, where hk is the height of 
an HCP chosen uniformly from among all HCP’s of size k. Setting y = 1 in the bivariate generating 
function recovers the univariate generating function (14.171) . The point (a;, 1), where a; = Xq is 
the smallest root of the denominator of (14.171) controls asymptotics in this direction; we compute 
Mir{x, 1) there to be {xH^, yHy ){xo, 1), which simplifies to r/s = 4(5 —14a;o + 12a:Q)/(5 —9a:o + lla:o) 
or still further to a := ^(147 — 246a:o + 344a;Q) « 2.207. We conclude that for large k, k/hk a. 
Thus we see that the average row length in a typical large HCP is around 2.2. Finally, let us see 
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how the computer algebra for general r turns out. To find (x, y) given r, solve the pair of equations 
H = 0, sxHx = ryHy] this find points on V with f = (r, s) G L(a;, y). Explicitly, we set A = s/r and 
ask Maple for a Grobner basis for the ideal generated by H and XxH^ — yHy. The following Maple 
code fragment is useful in such cases. 

Hx := diff(H,x): Hy := diff(H,y): X:=x*Hx: Y:=y*Hy: 

Hxx := diff(Hx,x): Hxy := diff(Hx,y): Hyy := diff(Hy,y): 

Q := -X"2*Y-X*Y"2-x"2*Y~2*Hxx-X~2*y~2*Hyy+2*X*Y*x*y*Hxy: 

L := [H,lambda*X-Y]: 
gb := Basis(L, plex(y,x)): 

Maple returns a basis consisting of polynomials a and {x — 1)^/3, where the quartic 

/3 := (1 + A)a;'‘ + 4(1 + A)^^^ + 10(A2 + A - l)x^ + 4(2fc - + (1 - A)(l - 2A) 

is the elimination polynomial for x and is generically irreducible. Furthermore a is linear in y. 
Rather than express y in terms of x, it is perhaps easier to compute the elimination polynomial for 
y, which is y^ times the following polynomial: 

(dA-^ - 4A3 - SA^ + 4A - 1)/ + (dOA-^ - ddA^ - 20X'^ + 48A - 16)^^ 

+ (-172A4 + 128A3 + leOA^ - 256A + 64)y^ + (1152A^ - 1024A3 - 5l2X‘^)y - 1024A‘‘. 

Note that when H = 0, x = 1 if and only if ?/ = 0. The point (1,0) is a solution to the critical point 
equations for every value of A. However it is never a contributing point, because = oo when any 
coordinate vanishes (recall f is in a compact subset of the positive orthant). 

Thus generically we have 4 candidates for contributing points. Precisely one of these is minimal 
and in the first quadrant. The others do not contribute for generic r, which may easily be checked 
for any given r; a verification for all r simultaneously would require more computer algebra. 

Finally, from Corollary 13.211 we see that 

^ xy{l — x)^ Iy {—X {I — X — x'^ + x^ + x'^y) — x^y) 

V Q ■ 

The minimal polynomial for can be computed as in previous sections; it turns out to have 

degree 8 for generic A. Of course, given floating point approximations for x and y, we may simply 
compute an approximation for C directly. 

As an example, suppose that n = 2k so that A = 1/2. In this case we have simplification and the 
minimal polynomials for x and y respectively are Zx^ + 18a: — 5 and 75y^ — 288?/ + 256. Note that 
there is a single element 

^ 48 

\Iy~ ’ 25 




{xo,yo) ■■ 


(0.265986,1.397442) 
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of Bcrit in the positive quadrant. By Theorem 13.161 this belongs to ficontrib, so by Proposition 13.81 
Bcontrib is the singleton {(xo,?/o)}- We obtain (by naive floating point computation in Maple) 
On,ra /2 ~ (0.237305 ... )(3.18034... For n = 60, the relative error in this first order ap¬ 

proximation is about 1.5%. Contrast the exponential growth rate with the exponential growth rate 
3.20557 for the number of HCPs with n cells and any number of rows. 


4.7. Symmetric Eulerian numbers. The symmetric Eulerian numbers A{r, s) Com74 page 246] 
count the number of permutations of the set [r-bs-l-l] := {1,2,..., r-|-s-l-l} with precisely r descents. 
By reading backwards, this equals the number with exactly s descents, hen ce the s ymmetry in r and 
s. The symmetric Eulerian numbers have exponential generating function C.T04 2.4.21] 


(4.19) 


F{x,y) = 


_ pV 


xey — ye^ 


= E 


A{r,s) 


"I cl 


=: ^ QrsX^y'" 
r.s 


The denominator in this representation is singular at the origin and has a factor of (x — y) in common 
with the numerator. We factor this out of the numerator and denominator, writing F = G/H with 
G = (e^ — e*^)/(x — y) and H = (xe^ — yeA/{x — y)- In this representation, both the numerator and 
the denominator are entire functions. We quickly check that in the positive real quadrant, V is the 
graph of a monotone deceasing function as in the left-hand side of Figure [5] and that the quantity 
Q (see the Maple code fragment in Section l4l6l) does not vanish on V: 


first check that F is not entire but is analytic in a neighborhood of the origin; 

next, L’Hopital’s rule shows us that F[ never vanishes on {x = y} except at (1,1); 

rearranging terms then shows that F[ vanishes when x and y are two positive real numbers 

with the same height h{t) := t — logt (see the right-hand side of Figure [5]); 

the exact symbolic expression for Q has limit e^/12 at (1,1) and this is the minimum value 

of Q on V. 




Figure 8. the positive real points of V and their description by height-pairs 
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The symmetric Eulerian numbers are nonnegative, so we can find minimal points of V in the first 
quadrant. It is easy to check that the gradient of H never vanishes on V, so V is smooth. At the 
point X = y = 1, we have dH/dx = dH/dy = 1, so 6 dir(l, 1) = 1. For any other point of V, we may 
work with the representation of H having (x — y) in the denominator. We set 

xdH/dx 

xdH I dx + ydH / dy 

so that {x,y) S r f = (a, 1 — a). The expression for a does not look all that neat but on V 
we may substitute {yfx)e^ for e^, after which the expression for a reduces to (1 — x)/{y — x). We 
see therefore from Theorem 13.191 that A(r, s) is asymptotic to Ca{r + where for a 

given value of a = r/{r + s), the value of 7 is given by x~°‘y~^^~°‘'^ after solving the following 
transcendental equations for {x,y): 

(4.20) = a; 

y-x 

xe^ = ye^. 


We do not know whether there is a closed form expression for 7 in terms of a, though we note that 
by using (I4.20p several times, one may simplify somewhat: 7 = {y/x)°‘y~^ = exp(a{y — x))/y = 
exp(l - x)/y. 


The fact that the equations (I4.20p have a unique positive real solution is possible to verify directly, 
but also follows from nonnegativity of the coefficients and the fact that the positive real part of V lies 
along the boundary of the domain of convergence. One must still check that T(x(a), ?/(a)) contains 
no other points of V. This is true for generic f and can easily be checked for a given f, but a general 
proof that it works for all r is not obvious. 


4.8. Smirnov words. Given an integer d > 3 we define a Smirnov word in the alphabet {1 ,... ,d} 
to be a word in which no letter repeats consecutively. The number of Smirnov words of length n 
is of course easily seen by a direct counting argument to be d{d — 1)"“^. If we count these words 
according to the number of occurrences of each symbol, we get the multivariate generating function 


(4-21) F{z) = Y, «rz'' = -- 

1 - Li=i T+i- 

where Or is the number of Smirnov words with Xj occurrences of the symbol j for all 1 < j < d. 
This generating function may be derived from the generating function 


E{y) 


I 


for all words as follows. Note that collapsing all consecutive occurrences of each symbol in an 

arbitrary word yields a Smirnov word; this may be inverted by expanding each symbol of a Smirnov 

word into an arbitrary positive number of identical symbols, whence F {———,..., ———) = E(y). 

—I 1 - yi 1 - dd 

Substituting yj = Zj/{1 + Zj) yields the formula (14.211) for F; see [FSl . II.7.3, Example 23] for more 
on this old result. 


















When we write F as a rational function, we see that 
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H = Ei=o(l -i)ej(z) 


where ej is the elementary symmetric function of zi,..., and eg = 1 by definition. Thus when 
d = 3 for example, one obtains H = 1— {xy + yz + xz) — 2xyz. The expression for H may be denoted 
quite compactly by 


H = g-u 


du 


U—1 


where (/(z, u) := 0^=1 polynomial in u whose coefficients are the elementary symmetric 

functions in z. When d > 3, the denominator is of the form 1 — P for an aperiodic polynomial P 
with nonnegative coefficients, so by Proposition 13.171 ficontribjr will always consist of one strictly 
minimal point in the positive orthant. 


The typical statistics of a Smirnov word of length n are not in doubt, since it is clear that each 
letter will appear with frequency 1/d. This may be formally deduced from Theorem 13.281 which also 
shows the variation around the mean to be Gaussian. We may, perhaps, be more interested in the 
so-called large-deviation probabilities: the exponential rate at which the number of words decreases 
if we alter the statistics. Let s denote some frequency vector. Then as |r| ^ oo with f ^ s, we have 

1 ^ log Or ^ -r • log|z| 

where z G O'^ satisfies fidir(z) = r. 

To solve for z in terms of r, we use the symmetries of the problem. It is evident that Zj is 
symmetric in the variables {r^ : i ^ /}. For example, when d = 3, if we refer to z as {x^y,z) and f 
as (r, s,t), then the equations (j3.5p have the solution 

r^-js- tf 
2r(s + t — r) 

where the values of y and z are given by the same equation with r, s and t permuted. When d > 4, 
the solution is not a rational function and it is difficult to get Maple to halt on a Grobner basis 
computation. This points to the need for computational algebraic tools better suited to working 
with symmetric functions. 


4.9. Ali gnmen t s of sequences. The problem of sequence alignment is of interest in molecular 
biology Wat95l : |RT|, since a given string may evolve via substitutions, insertions, and deletions. 


We seek to place several strings of varying lengths in parallel, which may necessitate adding some 
spaces. A basic problem underlying design of algorithms which seek the best alignment in the sense 
of minimizing some score function is simply to count such alignments. Note that for this problem 
the elements of the string are irrelevant — we only care whether there is a letter or a space. 











50 


Mathematically, given positive integers k and a A:-tuple n = (ni,..., rife), we may define a {k; n)- 
alignment as a fc x n binary matrix for some n, such that no columns are identically zero and the 
rth row sum is Ui. Column j is aligned if it has no zero entry, and a block of size 6 is a fc x 6 
submatrix, with contiguous columns, all of which are aligned. For example, Figure [9] shows an 
alignment of 3 strings, and the corresponding matrix. The generating function giving the number 
of (fc, n)-alignments with the ith row having sum rii is given by 

. = 


wherep(z) wnl=i(l + rj). 


c 

c 

A 

G 

T 

G 

A 

G 

G 

T 
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T 
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Figure 9. A (3; 11,9, 9)-alignment and its corresponding binary matrix. 


The special case where all sequences involved have the same length (all rows have the same 
sum) is easily dealt with. By Proposition 13.171 there is a single strictly minimal point in the 
positive orthant that controls asymptotics in the diagonal direction. By symmetry of F and of 
the equation Qdir{z) = 1, this point has the form z = zl for some positive z. Thus we have 
z = 2^/^ — 1 and hence the asymptotic ha s the form C{z~^)^n 2 “. This result was derived by 
Griggs, Hanlon, Odlyzko and Waterman in GHOW90|| using a saddle point analysis. Note that for 
large fc, z~^ ~ 2“^/^fc''(log2)“'^. 


To compute the constant C, we use the formula of Theorem 13.201 It is readily computed that 
—ZkdF[/dzk = ^zl (1+z). To compute the Hessian determinant, we need only consider the Hessian in 
01 ,..., Qd-\ of — lognj<fc(l+'^62cp(*0j))+log(2—f([^^j,(l+zexp(*0j)). Direct computation using the 
fact that exp(j0j)) takes the value 2/(1+z) when 0 = 0 shows that the Hessian matrix has 

diagonal elements 2/(l + z) and off-diagonal elements l/(l + z). Thus Ti. = fc(l+ z)“^*^“^^ = k2^/^ 12 
and this yields 

2(i-fc^)/2fc 


C = 


(2i/fe _ l)Vfc7r^ 1 


Wat95 


This yields the result of GHOW90l| (note that there is an error in the displayed formula on 
p. 1989] - the factor should read r, and the factor 2*^^ -i)/(2fe) should be 2^^~^ 

A more important case for biological applications is when the min imu m block size is bounded 
below, by 6, say. The generating function in this case is shown in RTlj . by means of standard 
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operations on generating functions of formal languages, to have the form 

A 1-t + t^ 


F(zi,...,zk) = 


l + (l-p)A+(A-l)t (l-t)(l-g)-Fg 



where t = Uj 9 = P ~ ^ ~ t and A = 1 — t + t^. Note that when 6 = 1 then A = 1, and we recover 
the unrestricted case analysed above. In this case, asymptotics for the case where all sequences have 
equal length have been derived (as far as we know) only for the case fc = 2, using the “diagonal 
method” discussed in Section [H However, we can readily deal with general k using the methods of 
this paper. 


Again, by symmetry we need only look for a contributing minimal point of the form z = zl. 
Let H = 1 + (1 — p)A + {A — l)t. Since t = z^ and p = (1 + we must find the root(s) p of 
smallest modulus of h{z) := H{z, z,..., z). Note that the third formula above shows that we may 
take h = 1 — P where P is aperiodic with nonnegative coefficients, so there will be a unique root 
p of smallest modulus, and it will be positive real. The exponential growth rate is then p~^ with 
polynomial correction of order as above. 


We note that in the case k = 2, such a result was proved in [GHW86|, but stated slightly 
differently. The value r = is given as the minimal positive root of the polynomial (1 — x)'^ — 
4a:(l — X + x^Y- Setting x = z^ and factoring the difference of squares yields p as the minimal 
positive root of H{z, z) = 1 — 2z — z'^ + 2z^ — 


To better estimate p := pb, note that pb < 1 so it is reasonable to consider the approximation 
obtained by setting all powers involving z^ to zero. The minimal real zero pi of (1 — t)(l — g) is 
asymptotically of order 1/k (and equals 2^/^ — 1). Now pb > pi but pb should be close to pi. Indeed, 
it appears that an iteration scheme based on the fixed point equation 

Ijk 


Z= I + z'‘ 


I- Z^ 


^kb 


l-z^ 

given by h{z) = 0 converges rapidly to pb from starting point pi. 


- 1 


4.10. Probability that there is an edge in an induced subgraph. From the n-set [n] := 
{1,... ,n}, a collection of t disjoint pairs is named. Then a k element subset. S' C [n], is chosen 
uniformly at random. What is the probab ility p{n, fc, t) that S fails to contain as a subset any of the 
t pairs? This question is posed in LPS^| as a step in computing the diameter of a random Cayley 
graph of a group of cardinality n when k elements are chosen at random (the diameter is infinite if 
the k elements do not generate G). 


There are a number of ways of evaluating p{n, fc, t), one of which is by inclusion-exclusion on the 
number of pairs contained. This leads to 


(4.22) 


p{n,k,t) = 


i =0 


n — 2i 
k — 2i 


2=0 


n — 2i 
n — k 


-1 
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Here, we define the binomial coefficient to be zero unless 0 < q < p. 

The numbers a(n, fc,t) := {^p(n,k,t) are simpler to analyse via generating functions; since for 
most purposes only a first-order asymptotic is sought, we lose nothing in considering a(n, fc, t). Note 
that from the description above, a{n,k,t) = 0 ii k + t > n, hy the pigeonhole principle (since the 
complement of S has size less than t, S must contain at least t -f 1 of the 2t chosen elements). From 
the statement of the problem, a{n, k, t) is not defined if 2t > n; however, formula (14.221) still makes 
sense in that case, even though it does not define the probability of any event. In fact, a(n, fc, t) can 
be negative for large t. 


The most direct approach to finding a trivariate generating function of a(n, fc, t) is to use some 
well-known bivariate generating functions j ■ If ciij = C~j^) then the GF is (1 — a; — y)~^, 

while that for = (p is (1 — a;(l -I- y))~^■ We now compute 


, U 4 -i V / 


/n — 2 A 
\k — 2i) 


N,K,iJ 





{zwx^y'^yz^ 




X 



1 1 
1 — z{l + wx'^y'^) 1 — a:{l -|- y) ’ 
which yields the trivariate generating function 


(4.23) 


F(.x, y,z)=^ a(n, k, t)x'^y^z* 

n,k,t 


1 1 

1 — z(l — x^j/^) 1 — x{l + y) 


Note that if we impose the restriction 2t < n, then the sum over N,K is restricted to N > 2j. 
Now summing over N, K, t, i as above we obtain the restricted trivariate generating function 

1 1 


F2{x,y,z)= 


a{n, k,t)x'^y^z* = 


{n,fe,i: 2t<n} 

The advantage of F 2 is that all its coefficients are nonnegative. 


1 — a;(l -I- y) 1 — zx‘^{l + 2y)' 


We show how to give asymptotics for a(n, A:, t) in an arbitrary direction for which n > k + t. For 
the specific case of A: = [cnj ,t = (n — 4) /12 wi th 0 < c < 1, such an analysis was posed as an open 
question at the end of an early draft of |LPS~*~ |. 


It turns out that the computations can be carried out equally well with F or ^ 2 - However, with 
F 2 we can use Theorem 13.161 if we are only interested in the behaviour of o(n, k, t) for 2t < n. 

The denominator of F factors into two smooth pieces, call them Hi := 1 — x{\ + y) and H 2 := 
1 — z{l — x^y'^). There is a corresponding stratification of V into two surfaces, Vi := {Hi = 0 ^ H 2 } 
and V 2 = {H 2 = 0 y Hi}, and a curve, Vo := {Hi = H 2 = 0}. For z G Vi, lack of dependence on 
z means that fidzr(z) T ( 0 , 0 ,1), so Qdir cannot be in the strictly positive orthant; hence for r G O, 
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there are no points of ficritr in Vi. It turns out there are no points of ficritr in V 2 either. This is 
discovered by computation. If z € V 2 H Bcritjr and r = (r, s, I), then z satisfies the equations 

i?2(z) =0; 


dH2 dH2 

X— - rz —— = U ; 


dx 


dz 


dH2 dH2 

oy oz 

These equations turn out to have no solutions: Maple tells us in an instant that the ideal generated 
by the left-hand sides of the three above equations in C[a:, y, z, r, s] contains r — s; thus V 2 may 
contain points of ficritr only when r = s, that is, only governing asymptotics of a{n, k,t) for which 
n = k, which are not interesting. 


Evidently, ficritr C Vq. Two equations are Hi = H 2 = 0, that is, 

The last equation is that f is in the linear space spanned by Viogi?i and Viogi? 2 - Setting the 
determinant of (f, Viogi?i, ViogI? 2 ) equal to zero gives the equation 

(4.24) 2(r — s — l)y‘^ + {r — 3s)y — s = 0. 

Note that exactly the same equations are obtained for x,y,z when using ¥ 2 - 

If r > s -I- 1, the discriminant (r -|- s)^ — 8s of the quadratic in (14.241) is positive, and (14.241) has 
one positive and one negative root, the positive root being 

\/{r -\- sY — 8s — {r — 3s) 

4(r-s + I) ■ 


Plugging into for and z_|_ yields expressions for these which are also quadratic over Z[r, s]. 
The expression for (a:_, j/_, z_) is just the algebraic conjugate of (a:+, ?/+, z+), but is negative in the 
second and third coordinates. 


Having identified ficritr as these two conjugate points, which we will call z_|_ and z_, it remains 
to find ficontribr. Since the two elements lie on different tori, we may conclude from Theorem 13.161 
that ficontribr = {z+}- 


The form of the leading term asymptotic is then given by Theorem 13.261 

' k t' 


i(n, k,t) C 




For example, in the direction (12,12c, 1) with c < 11/12, we can compute the exponential growth 
rate and compare with that of and thereby show that certain random Cayley graphs have 

diameter 2 with very high probability. For details, see LPS~*~ |. 
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Analysis in the unrestricted case where we may have 2t > n is more difficult. In fact [x'^y^z*]F 
need not be zero when k +1 > n and 2t > n, and may be negative. 


4.11. Integer solutions to linear equations. Let Or be the number of nonnegative integer solu¬ 
tions to Ax = r where A is a d x m integer matrix. Denote by the column of A. Then 






k=l 


Sta97. Sec- 


This enumeration problem has a long history. We learned of it from |DLS03j| (see also 
tion 4.6]). One special case is when A = Am,n is the incidence matrix for a complete bipartite graph 
between m vertices and n vertices. Solutions to Ax = r count nonnegative integer m x n matrices 
with row and column sums prescribed by r; enumerating these is important when constructing sta¬ 
tistical tests for contingency tables. Another special case is when A = A„ is the incidence matrix of 
a complete directed graph on n vertices, directed via a linear order on the vertices. Here solutions 
are counted in various cones other than the positive orthant and the function enumerating them is 
known as Kostant’s partition function. 


It is known that Or is piecewise polynomial, and it has been a benchmark problem in computation 
to determine these polynomials, and the regions or chambers of polynomiality, explicitly. Indeed, 
several subproblems merit benchmark status. Counting the chambers for Kostant’s partition function 
is one such problem. Another is to evaluate the leading term for the diagonal polynomial a„i in the 
case where A = Ak^k, the so-called Ehrhart polynomial, which counts k x k nonnegative integer 
matrices with all rows and columns summing to n. Equivalently, this counts integer points in the 
n-fold dilation of the Birkhoff polytope, defined as the set in of all doubly stochastic k x k 
matrices; the leading term of Uni is the volume of the Birkhoff polytope. 

Our methods do not improve on the computational efficiency of previous researchers: our ending 
point is a well known representation, from which other researchers have attempted to find efficient 
means of computing. Our methods do, however, give an effective answer, which illustrates that this 
class of problems can be put in the framework for which our methods give an automatic solution. 
The remainder of this section is devoted to such an analysis. 

Our examination of the pole variety V begins with an answer rather than a question: we see that 
1 S V and realize that life would be easy if ficontribr = {1} for all r. In order for 1 to be a singleton 
stratum it is necessary and sufficient that the columns b^^^ span all of R'’*. For the remainder of the 
section we assume this to be the case. This satisfies our standing assumption 13.61 

The variety V is the union of the smooth sheets Vk, where for 1 < fc < m, Vfe is the binomial 

,(k) 

variety {z : Hk{z) := z° — 1 = 0}. On each of these varieties, Mivkiz) = Viog(iIfc)(z) is constant 
and equal to b^*). If a stratum S is bigger than a singleton, then at any z G S, the vectors ViogHk 
span a proper subspace of R'^; but these vectors do not vary as z varies in S, so the union over 
z S S' of L(z) is this same proper subspace of R"*. Consequently, the union over all points of all 
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non-singleton strata of L(z) is a union of proper subspaces, which we denote H'. For r ^ H', then, 
ficontribr consists of one or more singleton strata. 

Taking logs, we see that logVfc is a hyperplane normal to and is central (passes through 
the origin). We see that K(0) is the positive hull of the vectors this hull is H and outside 
of the closure of this, Or vanishes. Forgetting about the logs, we see that 1 S ficontribr for all 
r G Hq := H \ H': this follows from Theorem 13.231 since r G K(l). 

If the intersection of all the surfaces Vk contains any other points on the unit torus T(l) then 
these too are in ficontribr for all r G Hq. This is easy to check, since it is equivalent to the integer 
combinations of the columns of A spanning a proper sublattice of lA . For instance, in the example 
of counting matrices with constrained row and column sums, the columns of A span the alternating 
sublattice of lA, corresponding to the fact that ficontribr = {1,-1} for all r G Hq. 

There may be singleton strata given by intersections of subfamilies of {Vfe : 1 < fc < m}, that 
lie on the unit torus, but in computing the leading term asymptotics these may be ignored because 
they yield polynomials in r of lower degree. 

In summary, for any r G Hq, the leading term asymptotics are given by summing (I3.12p over a 
set containing 1, and isomorphic to the quotient of lA by the integer span of the columns of A. Our 
a priori knowledge that Or are integers, teamed with (13.111) as in the last part of Theorem 13.231 
shows that in fact are piecewise polynomial, at least away from H'. Thus, except on a set of 
codimension 1, we recover the well known piecewise polynomiality of a^. 


In their paper, de Loera and Sturmfels use as a running example the matrix 


A = 


10 0 11 
0 10 10 
0 0 10 1 


In this example we see that the columns of A span 1? over Z, so the only contributing point is 
1. The first three b vectors, in the order given, are the standard basis vectors, so the cone H is 
the whole positive orthant of RP^, which is a 2-simplex. The other two b vectors are two of the 
three face diagonals of this cone, which are two midpoints of edges of the 2-simplex in RP^. In 
addition to the boundary of H, there are three projective line segments in H', corresponding to the 
one-dimensional strata V4 n V5, V4 n V3 and V5 n V2. The complement of these three line segments 
(two medians of the simplex RP 2 and the line segment connecting two midpoints of edges) divides 
RP^ into five chambers, and on each of these chambers O y is a qu adr atic polynomial. Algorithms for 
computing these polynomials are given, for example, in 


nbsn.ll or [BP|, Section 5]. 


4.12. Queueing theory. Queuing theory describes the evolution of a collection of jobs that enter 
the system, exit the system and change types stochastically as they are worked on by one or more 
servers. There are many variations on this model. The example we consider here is a closed 
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network with no class-hopping, meaning that no jobs enter, leave or change type. The model is 
defined in terms of the parameters J,K,{Xj : 1 < j < l<fc< K}, and 

{Pj\ki : 1 < j < ■/, 0 < fc, ^ < K} as follows. 

Let J be the number of types of jobs. Let iL -|- 1 be the number of servers. In our example, 
server fc (1 < fc < K) distributes its resources evenly among all jobs in its queue, serving jobs of type 
j at rate Pjk/n when there are n jobs in its queue, while server 0 is an infinite server, which serves 
all waiting jobs simultaneously, serving a job of type j at rate Aj independent of how many jobs it 
is serving. A job of type of type j at a station fc G {0,... ,K} waits to be at the front of the queue 
(if k > 1), then waits to be served, then moves to server I with probability Pj-ki- All service times 
are independent exponential random variables and the movement between servers is Markovian and 
independent of the service times. 


The entire model is Markovian if one takes the state vector to be the J x {K -|-1) matrix Q = Q{t) 
giving the number of jobs of each type waiting at each server. Let nj = nj{t) = J2k=o Qjk{t) be the 
number of jobs of type j present in the system at time t. The assumptions of our model (closed, with 
no class-hopping) imply that the vector n = (ni,... ,nj) is constant. Assuming also that enough 
of the pj.^ki are nonzero that any job at any server can get to any other server, the process started 
from any n = n(0) will be an ergodic Markov chain on the set of states S'(n) of J x {K + 1) matrices 
with row sums n. There will be a unique stationary distribution 7 r„. Explicit product formulae exist 
for TTn{Q) of the form 7 rn(Q) = G^^n(* 5 )> where P is a large product and (^^(n) is the normalizing 
constant or partition function. Typically, is easier to estimate than G'(n), so much of the work 
in analyzing this sort of queueing network is in estimating G(n). 


It turns out that there is a relatively simple generating function for the quantities G(n). One 
must first derive quantities pji, which are stationary probabilities for a single job migrating through 
the network. The partition generating function is then given by 


(4.25) 


^(z) := = 


exp(zi -I-- \- Zj) 


rifci 

Th is equa tion is given in iKogOi Equation (44)] (though the definition of pji here is sketchy) or 


BM93L Equation (2.26)] for a slightly more general model (more than one infinite server). 


Evidently, the denominator of F decomposes into linear facto rs. A general analysis of this case, 
including an algorithm for identifying ficontrib, is provided in BP[. Here, we work the simplest 
nontrivial case, J = K = 2. Thus 

exp(a: -|- y) 


F{x,y) = 


(1 - piix- P 21 y){l - pi 2 X- P 22 y) 


for positive constants pji. This example is worked in BM93I] but without the infinite server. 


Assume that pn > pi 2 , which loses no generality except for allowing pn = pi 2 . The most 
interesting case is then p 22 > P 2 i- The singular variety V consists of two lines with negative slopes. 
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and we have assumed order relations implying that the lines intersect in the positive quadrant. This 
is exactly what is shown in Figure [3l 

To recall the information from that figure, let 

D ■= P11P22 — P12P21 

denote the determinant of the matrix [pji]. The set V has three strata, one being the point 

, ^ (P22- P2I Pll - Pl 2 \ 

(""■»> f— D—D— j 

where the lines meet and the others being the two lines, each with the point (xo,j/o) removed. 
Whenever a stratum is linear, there is a unique solution to the critical point equations (j3.5p on the 
stratum (use concavity of the logarithm to see this), hence Bcritf has cardinality 3 unless two of the 
critical points coincide. One of the critical points is always the point {xo,yo). Figure [TUI shows the 
location of all three critical points as r varies from near horizontal to near diagonal to near vertical. 





Figure 10. The three possible configurations of critical points 


The three configurations correspond to a division of the positive orthant of into three regions. 
Transitions between the regions occur when r is equal to 

^_ Pii{P22 - P21) 

P2i{pii — P12) 
or 

^_ Pl2{p22 — P21) 

P22{P11 — P 12 ) 

The middle region is the K(a:o,yo) of directions between Ai and A 2 . In the interior of this cone, 
asymptotics are given by Corollary 13.241 The determinant of the Hessian of the denominator of F, 
evaluated at {xo,yo), is equal to —D^. The corollary therefore yields 


(Xj-s 


— r —s 

Xq Vo 


exp(a;o + yp) 
xo yoD 


To be completely explicit, consider the denominator from Figure O Here pn = P 22 = 2/3 and 
P12 = P21 = 1/3. The boundaries of the cone have slopes 1/2 and 2. Both xq and yo are equal to 1, 
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SO for (r, s) ^ oo in a compact subcone of K(a:o, yo). 


ars - 1/3 -3e 


22.1671. 


The actual value for r = s = 80 is oso.so ~ 22.1668 and the the relative error is 1.7 x 10“®. Even 
at r = s = 40, the error is only one quarter of one percent. As in Section 14.41 there is a Gaussian 
scaling window at the boundary of K(a;o,?/o)- Thus, for example, a„, 2 n ~ |e^ « 11.083. The error 
decays more slowly on the boundary (order though our treatment here does not extend to 

this depth), so for instance 040,80 ~ 10.893 differs from the limiting value of |e^ by 1.7%. The 
standard deviation in the scaling window is a little under for example, when n = 80 the 

standard deviation is around 8. Checking 043,80 we see a value of roughly 18.45, which is 83% of 
the interior value of 3e^ while at two standard deviations above we get 055,80 ~ 21.49 or 97% of the 
interior value; at one and two standard deviations below we have 032,30 ~ 3.018 and 024,30 ~ 0.241 
which are respectively 13.6% and 1% of the interior value. 


5 . Transfer matrices 


Suppose we have a class C of combinatorial objects that may be put in bijective c orresp ondence 


with the collection of paths in a finite directed graph. For example, it is shown in Sta97l . Exam¬ 
ple 4.7.7] that the class C of permutations tt G K(j) ~ j | J the 

domain of tt is of this type. Specifically, the digraph has vertices V := {—1, 0,1}^ \ {(0, —1), (1,0)}, 
with a visit to the vertex (o, b) corresponding to j for which 7r(j) = a and 7r(j -|- 1) = 6; the vertices 
(0,-1) and (1,0) do not occur because this would require 7r(j) = 7r(j -|- 1), which cannot happen; 
the edge set E is all elements of except those connecting (1, x) to (j/, —1) for some x and y, since 
this would require 7r(j) = Tr{j + 2); the correspondence is complete because once + 1) and 

7r(j -I- 2) are distinct, no more collisions are possible. 

The transfer-matrix method is a general device for producing a generating function which 
counts paths from a vertex i to a vertex j according to a vector weight function W such that 
lT(ui,..., Vn) = ^27=1 w{vi,Vj) for some function w : V x V ^ N'^. 


Proposition 5.1 (Theorem 4.7.2 of Sta97|). Let A be the weight matrix, that is, the matrix defined 
by Aij = ^ where z = {zi,... ,Zd) and Sij denotes 1 if {i,j) G E and 0 otherwise. Let Cij 

he the subclass of paths starting at i and ending at j and define F{z) = j Then 

(_l)»+i det(/ — A : i,j) 


F(z) = ((/-A)-i)^,= 


det(/ — A) 

where {M : i,j) denotes (i, j)-cofactor of M, that is, M with row i and column j removed. 


□ 


Thus any class to which the transfer-matrix method applies will have a multivariate rational 
generating function. It is obvious, for example, that the transfer-matrix method applies to t he class 
of words in a finite alphabet with certain transitions forbidden. The examples discussed in Sta97l . 
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pages 241-260] range from restricted permutations to forbidden transition problems to the derivation 
of polyomino identities including (|4.17|1 . and finally to a very general result about counting classes 
that factor when viewed as monoids. Probably because techn iques f or extracting asymptotics were 
not widely known in the multivariate case, the discussion in Sta97j| centers around counting by a 
single variable: in the end all weights are set equal to a single variable, x, reducing multivariate 
formulae such as (14.1811 to univariate ones such as (I4.17|l . But the methods in the present paper 
allow us to handle multivariate rational functions almost as easily as univariate functions, so we are 
able to derive joint asymptotics for several statistics at once, which is useful to the degree that we 
care about joint statistics. 


5.1. Restricted switching. Some of the examples we have already seen, such as enumeration of 
Smirnov words, may be cast as transfer-matrix computations. A simpler example of the method is 
the following path counting problem. 

Let G be the graph on K + L + 2 vertices which is the union of two complete graphs of sizes 
K + 1 and L -I- 1 with a loop at every vertex and one edge xy between them. Paths on this graph 
correspond perhaps to a message or task being passed around two workgroups, with communication 
between the workgroups not allowed except between the bosses. If we sample uniformly among paths 
of length n, how much time does the message spend, say, among the common members of group 1 
(excluding the boss)? 


We can model this efficiently by a four-vertex graph, with vertices B 2 and C 2 , where B 

stands for boss (so x = Bi and y = B 2 ) and C refers to the workgroup. We have collapsed K 
vertices to form Ci and L to form C 2 , so a path whose number of visits to Ci is r and to C 2 is s 
will count for actual paths. 


Let F{u, V, z) = u^('^Fi)yN(u),c- 2 )^\u)\ N{uj, Cj) denotes the number of visits by uj to Cj 

and \uj\ is the length of the path uj. By Proposition l5.ll the function F is rational with denominator 
FI = det(/ — A), where 


A = 


and the states are ordered Ci, Ri, i? 2 , C' 2 - 
yields 


uz uz 0 0 

z z z 0 

0 z z z 

_ 0 0 vz vz _ 

Subtracting from the identity and taking the determinant 


00 do 

H = uz +UZV — uz — uzv + zv — 2z — ZV + 1 + z v -I- uz . 


Giving each visit to Ci weight K corresponds to the substitution u = Ku' and similarly, v = Lv'. 
Applying Theorem 1,8.271 and reframing the results in terms of the original u and v shows that the 
proportion of time a long path spends among Ci tends as n —> 00 to 

udH/du 
zdH/dz 


(K,L,zo) 
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where zo is the minimum modulus root of H{K,L,z). In the present example, this yields 

K{-z-zL+l + z^L-z^) 


-2Kz-2 KzL + K + A Kz^L -2zL + 2 + L-3z‘^L-3 Kz"^ 


for the proportion of time spent in Ci, where zq is the minimal modulus root of H{K, L, z). Trying 
if = 1, L = 1 as an example, we find that i?(l, 1, z) = l — 4:Z+2z‘^+3z^ — z^, leading to zq « 0.381966 
and a proportion of just over 1/8 for the time spent in Ci. There are 4 vertices, so the portion of 
time the task spends at the isolated employee is just over half what it would have been had bosses 
and employees had equal access to communication. This effect is more marked when the workgroups 
have different sizes. Increasing the size of the second group to 2, we plug in K = 1, L = 2 and hnd 
that zq ~ 0.311108 and that the fraction of time spent in Ci has plummeted to just under 0.024. 


5.2. Connector matrices. For a more substantial example, we have chosen a further refinement of 
the transfer-matrix method, called the connector matrix method. The specific problem we will 
look at is the enumeration of sequences with forbidden substrings, enumerated by the composition 
of the sequence. This is a problem in which we are guaranteed (by the general transfer matrix 
methodology) to get a multivariate rational generating function, but for which a more clever analysis 
greatly reduces the complexity of t he com putations. Our discussion of this method is distilled from 
Goulden and Jackson’s exposition |o.in4 


Let 5" be a finite alphabet and let T be a finite set of words (elements of S'"). Let C be the 
class of words with no substrings in T, that is, those (j/O) ■ ■ • j Un) S C such that {yj,..., yj+k) ^ T 
for all j,k > 0. We may reduce this to a forbidden transition problem with vertex set V '.= with 
k the maximum length of a word in T. This proves the generating function will be rational, but is 
a very unwieldy way to compute it, involving a |S|^ by |S|^ determinant. 


The connector matrix method of 


G08l| . as presented in [GJ04 pages 135-136], finds a much 


more efficient way to s olve th e forbidden substring problem (see also an elementary proof of these 
results via martingales Li80|). They discover that it is easy to enumerate sequences containing at 
least Vi occurrences of the forbidden substring Ui. By inclusion-exclusion, one can then determine 
the number containing precisely yi occurrences of oji, and then set y = 0 to count sequences entirely 
avoiding the forbidden substrings. 


To state their result, let T = {wi,..., Wm} be a finite set of forbidden substrings in the alphabet 
[d] := {1,..., d}. We wish to count words according to how many occurrence of each symbol 1,..., d 
they contain and how many occurrences of each forbidden word they contain. Thus for a word 77 , we 
define the weight T(r]) to be the d-vector counting how many occurrences of each letter and we let cr(ri) 
be the m-vector counting how many occurrences of each forbidden substring (possibly overlapping) 
occur in 77 . Let F be the (d -I- r?T,)-variate generating function with variables xi,..., Xd, yi, ■. ■ ,ym 
defined by 

F(x,y) 

V 
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Proposition 5.2 (Theorem 2.8.6 and Lemma 2.8.10 of G.T04| i. Given a pair of finite words, uj and 
to', let ficonnect(uj,ijj') denote the sum of the weight of a over all words a such that some initial 
segment (3 of to' is equal to a final segment of to and a is the initial unused segment of to. Formally, 


flconnect{uj,uj') = t'(ci) • 

q::(3/3,7) 

Let V be the square matrix of size m defined hy'Vij = ficonnecfiuji, LOj), denote the diagonal matrices 
Y := diag (j/i,..., L := diag ..., and let 3 to be the m by m matrix of ones. 

Then 

y) = [1 - (xi H- + Xd)- C{x, y - 1)]"^ 

where 

C(x,y) = trace ((I - YV)-^ YLJ) . 

In particular, setting y = 0, the generating function for the words with no occurrences of any 
forbidden substring is 

(5.1) F(x, 0) = [l - (a:i H-+ Xd) + trace ((I + V)“^LJ)] ^ . 


□ 


Remark. The function C is the so-called cluster generating function, whose (r,s)- coefficient counts 
strings of composition r once for each way that the collection of Sj occurrence of the substring coj 
may be found in the string. 


5.3. Forbidden substring example. Goulden and Jackson apply the results of the previous 
subsection to an example. Let S = {0,1} be the binary alphabet, and let T = {u;i,a; 2 } = 
{10101101,1110101}. The final substrings of length 1 and 3 of uii are initial substrings of uii with 
corresponding leftover pieces 1010110 and 10101. Thus Vn = + xfx^. Computing the other 

three entries similarly, we get 


V = 


/y.4 I ^2™3 ^3,.y.4 

X 2/v,4 ™ 0^3 I ry»2 ™2 ™4 

j ^ U /2 \ *^ 1*^2 *^2 *^ 1*^2 


We also obtain 


L = 


p3..^5 

61^2 

0 


0 

cfxl 


Plugging into m, denoting {xi,X 2 ) by {x,y), finally yields 

1 + x'^y^ + x'^y'^ + x^y'^ — x^y^ 


(5.2) 


F{x,y) = 


1 — X — y + x^y^ — x'^y^ — x^y'^ — x'^y° + 


- x'^y^ 


We make the usual preliminary computations on F. Write G and H for the numerator and 
denominator in (15.21) . A Grobner basis computation in Maple quickly tells us H has no singularities 
and that H = G = 0 does not occur in the positive quadrant of (the last command finds real 
roots of the relevant polynomial): 
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> Basis([H , diff(H,x) , diff(H,y)] , plex(x,y)); 

[ 1 ] 

> gb:=Basis([G,H],plex(x,y)) ; 

4 3 2 3 2 

[y + y - 2y + l,y + x- y + y +1] 

> fsolveCgb[1], y); 

-1.905166168, -0.6710436067 


Similarly, we see that Q and H vanish simultaneously only at (1,0) and at (0,1). Using our a 
priori knowledge that the coefficients of F are nonnegative, we apply Corollary 13.181 and look for 
minimal points on the lowest arc of the graph of V in the first quadrant. The plot of this is visually 
indistinguishable from the line segment x + y = 1, which is not surprising because the forbidden 
substrings only affect the terms of the generating function of order 7 and higher. More computer 
algebra shows the point {x,y) G ficontribr to be algebraic of degree 21. 


One question we might ask next is to what degree the forbidden substrings bias the typical word 
to contain an unequal number of zeros and ones. One might imagine there will be a slight preference 
for zeros since the forbidden substrings contain mostly ones. To find out the composition of the 
typical word, we apply Theorem 13.281 Setting y = x gives the univariate generating function 


f{x) = F{x,x) = 


1 


■ x^ + x"^ — x^ 


1 - 2a;- 


^10 


= ^iV(n)a 


for the number N{n) of words of length n with no occurrences of forbidden substrings. The root of 
the denominator of minimum modulus is xq = 0.505496 ..., whence N{n) ~ C(l/a;o)”. 


The direction associated to the point {xo,Xo) is given by the projective point 


(5.3) 


r = 


f dH dH\ 


(a;o,a;o) . 


The ratio Aq = r/s of zeros to ones for this r is a rational function of xo- We may evaluate xq 
numerically and plug it into this rational function, but the numerics will be more accurate if we reduce 
algebraically first in Q[a;o]. Specifically, we may first reduce the rational function to a polynomial 
by inverting the denominator modulo the minimal polynomial for xq (Maple’s gcdex function) then 
multiplying by the numerator and reducing again. Then, from the representation Aq = P{xo), we 
may produce the minimal polynomial for Aq by writing the powers of Aq all as polynomials of degree 9 
in Xq and determining the linear relation holding among the powers Aq, ..., Xq^. We find in the end 
that Aq = 1.059834.... Thus indeed, there is a slight bias toward zeros. 
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Suppose we wish to know how long a string must be before the count of zeros and ones tells 
us whether the string was sampled from the measure avoiding 10101101 and 1110101 versus the 
uniform measure. We may answer this by means of the local central limit behavior described in 
Theorem 13.281 We may verify that the proportion of zeros is distributed as 

' + cn"^/^iV(0,1) 

Ao + i 

where iV(0,1) denotes a standard normal. Once we have computed the constant c, we will know 

how big n must be before -—-and the count of zeros and ones will tip us off as to 

Ao + 1 2^ 

which of the two measures we are seeing. 


6. Lagrange inversion 


Suppose that a univariate generating function f{z) satisfies the functional equation f{z) = 
z4>{f{z)) for some function (j) analytic at the origin and not vanishing there. Such functions of¬ 
ten arise, among other places, in graph and tree enumeration problems. If (/> is a polynomial, then 
/ is algebraic, but even in this case it may not be possible to solve for / explicitl y. A b etter way to 
analyse / is via the Lagrange inversion formula. One common formulation states GJ04I . Thm 1.2.4] 
that 

(6.1) n/(z) = i 

n 

where [y"] denotes the coefficient of ?/”. 

To evaluate the right side of (16.HI . we look at the generating function 

OO 


1 - x(l){y) 


n—0 


which generates the powers of (j). The ^ term of this is the same as the y" ^ term of of (/)(y)". 

In other words, 

( 6 . 2 ) 


[z"]/(z) = i [x"yT . . . 

n 1 — x(j)[y) 


This is a special case of the more general formula for 'i/?(/(z)): 
(6.3) 


z-miz)) = - [x-y-] 

n 1 — X(/)(y) 


These formulae hold at the level of formal power series, and, if i/; and (j) have nonzero radius of 
convergence, at the level of analytic functions. 


We can now apply the analysis leading to (14.101) (taking note that the series v{x) in that equation 
is presently called (j){y), while the series (j){x) there is here called yip'{y)). Assume </) has degree at 
least 2 - the easy case (j){z) = az + b may be handled directly. 
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Recall the definitions of n and cr^ from Subsection 14.31 We are interested only in coefficients 
of x"?/", that is, the diagonal direction. Set := y(l>'/(j) equal to 1; geometrically, we graph 

(j){y) against y and ask that the secant line from the origin to the point (y, (j){y)) be tangent to the 
graph there. Letting yo denote a solution to this, we then have a point {l/(j){yo),yo) in the set 



Bcriti and at this point the quantity is equal to (^(yo)/2/o)” = equations (|4.5I) 

and (14.61) we have y(0;yo) = 1 and consequently cr^(^;yo) simplifies to yo(t}"{yo)/4’iyo)- Putting this 
together with the asymptotic formula (I4.10p . setting r = s = n and simplifying leads to the following 
proposition. Note that it is easily shown that / is aperiodic if and only if (j) is. 


Proposition 6.1. Let (j) he analytic and nonvanishing at the origin, aperiodic with nonnegative 
coefficients, with degree at least 2 at infinity. Let f be the nonnegative series satisfying f{z) = 
z4>{f{z)). Let y = yo he the positive solution of yip'(y) = 4>{y). Then if tp has radius of convergence 
strictly greater than yo, we have 


(6.4) 


[z'^miz)) 

k>0 


where bo = 


ypip'iyo) 

^2-K<p"{yo)/<p{yo) 


□ 


A variant of Proposition 16.11 proved by other means, is found in FSl . Thm VI.6]. That result, 
proved by univariate methods, is stronger than Proposition 16.11 in some ways. For example, it can 
handle the estimation of [z''']T{z){l — T{z))~‘^ where T{z) = zexp{T{z)), which occurs in the study 
of random mappings, whereas an attempt to use Proposition [6T] directly runs into the problem that 
yo = 1 and ip{z) = (1 — has radius of convergence 1. 


6.1. Bivariate asymptotics. We now discuss bivariate asymptotics. In the special case fiiy) = y^, 
for fixed k, the above results on Lagrange inversion yield the first order asymptotic 

7 k 

[z"]/(z)'= ~ -</.'(yo)"^= 

n yj2TTn(p"{yo)/(p{yo) 
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where yo(l}'{yo) = <t>'{vn) - We c an also derive an asymptotic as both n and k approach oo. We sketch 
an argument here (see Wil05l | for details). The formula (j4.10|) supplies asymptotics whenever n/k 
belongs to a compact set of the interior of (A, B), where A is the order of / at 0 and B the order at 
oo. In the present case we have A = 1 and B = oo. We may then use the defining relation for /(z) 
to express the formula in terms of (j) only, leading to the following result. 

Proposition 6.2. Let (j) be analytic and nonvanishing at the origin, with nonnegative coefficients, 
aperiodic and of order at least 2 at infinity. Let f be the positive series satisfying f{z) = z4){f{z)). 
For each n, k, set X = k/n and let y = y\ be the positive real solution of the equation y) = (1 —A). 
Then 

1 


(6.5) 


n^,k—n 


[z-]f{zY ^ Xf^iyxTyl 


^y2TTna‘^{(j); yx) 


= A(1 - X)-^<f'{yxr 


vi 


^/2TTna^{(i)iyx) 


Here y and are given by equations EIP and respectively. The asymptotic approximation 

holds uniformly provided that A lies in a compact subset of (0,1). □ 


The similarity between (j6.4p and (16.5p seems to indicate that a version of Proposition 16.21 that 
holds uniformly for 0 < k/n < 1 — e should apply. This is consistent with a result of Drmota Drm94| 
but is seemingly more general, and wa rrants further study. Such a result can very likely be obtained 
using th e extension by Llad ser Lla03l| of the methods of PW02j | , as can results of Meir and Moon 


MM90|| and Gardy |Gar95l | related to Proposition 16.21 


6.2. Trees in a simple variety. We now discuss a well-known situation FSl . VII. 2] in which the 
foregoing results can be applied. 


Consider the class of ordered (plane) unlabelled trees belonging to a so-called simple variety, 
namely a class W defined by the restriction that each node may have a number of children belonging 
to a fixed subset fl of N. Some commonly used simple varieties are: regular d-ary trees, LI = {0, d}; 
unary-binary trees, = {0,1, 2}; general plane trees, fl = N. The generating function f{z) counting 
trees by nodes satisfies f{z) = zuj{f{z)) where uj{y) = 

Provided that uj is aperiodic. Proposition 16.11 applies. The form of uj shows that the equation 
yu}'{y) = w(y) always has a unique solution strictly between 0 and 1. As a simple example, we 
compute the asymptotics for the number of general plane trees with n nodes (the exact answer 
being the Catalan number C„_i). The equation y4)'{y) = (f>{y) has solution y = 1/2, corresponding 
to (j>'{y) = 1/4. Thus we obtain 

n/(^)-^4 

in accordance with Stirling’s approximation applied to the expression of Cn-i in terms of factorials. 

Proposition l6.1l does not directly apply to the case of regular d-ary trees. Rather, we use a version 
where the contribution from several minimal points on the same torus must be added. The details 
are as follows. The equation y) = I has solutions ujp where p = {d— 1)“^/'^ and = 1. Each 
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of these is a contributing critical point and the corresponding leading term asymptotic contribution 
is where C = p^{2TT{d— 1))“^/^ and a‘^ = 1)'^“^). Thus the asymptotic 

leading term is 0 unless d divides n — 1, in which case it is dCa'^’nT^!'^. 


We note in passing that one can avoid the last computation as follows FS|, 1.5, Example 13]. 
There is a bijection between the class of d-ary trees and the class C of trees with vertices of degree 
at most d but allowing for types of nodes of degree j. The pruning map removes all external 
nodes (nodes of degree 0) from a d-ary tree, then labelling each node of the pruned tree with the 
set of children that were removed. Pruning always maps a tree of 1 -I- dm nodes to a tree of m 
nodes. The extension of the above asymptotics to a multiset of degrees (allowing for different types 
of children) is straightforward. Thus we can compute using the degree enumerator g{z) = (1 H- z)‘^ 
of C in ProDOsition l6.ll and [z^]f{z) = [z‘^]g{z). 

Now we consider the mean degree profile of trees in W. Let ^k{t) be the numbe r of nodes of 
degree k in the tree t, and \t\ the total number of nodes in t. A standard calculation FSl . VII.2.2, 
Example 5] shows that the cumulative generating function is 

tew 

where (pk = [y^](piy)- Thus we have 

F{z, m) ^ ^ = /r(/; z)z(p(uf{z)). 

k>otew 

The mean number of nodes of degree fc in a uniformly randomly chosen tree of size n from W is 
then given by 


Mnk = 


'^]F{zp, 


[z"]/(z) 

Consider again the simplest case, general plane trees, with (j){y) = (1 — y)~^. Then F{z,u) corre¬ 
sponds to a Riordan array. A simple variant of Proposition 16.21 shows that 


[z-u^]F{z,u)^y'^<P'{y) 


y'2TYna'^{(j)]y) 


where y) = 1 — k/n. This is easily solved to obtain y = (n — k)/{2n — k) and hence we obtain 
routinely: 


( 6 . 6 ) 


Fdnk ~ kL 


r, , \ 2n —2 / 1 \ k 

2n — k \ { n — k 


2n 


2n — kJ y 2(n — k)(2n — k) 


uniformly as long as k/n is in a com pact subset of (0,1). The mean number of leaves (nodes of degree 
0) in such a tree is well known FSl . III.5, Example 12] to be n/2, which is obtained by substituting 
fc = 0 in the right side of (ES]). Thus it again seems likely that the approximation is in fact uniform 
on [0,1 — e]. 
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6.3. Cores of planar graphs. A rooted planar map is a graph with a distinguished edge (the 
root) that can be embedded in the plane. A rooted planar map is completely specified by a planar 
graph, a distinguished edge, and a cyclic ordering of edges around each vertex. The core of a 
map (henceforth always a rooted planar map) is the largest 2-connected subgraph containing the 
ro ot edge. The pro blem of the typical core size (cardinality of the core) of a map is considered 


GW 99 ; 


BFSS0l|. They obtain a functional equation for the generating function 


M{u,z) = 


UnhZ U 


where an,k is the number of maps with n edges and core size k (that is, the core has cardinality 
k). They are interested in computing the probability distribution of the core size of a map sampled 
uniformly from among all maps of with n edges. Applying the general inversion formula (16.3p to 
their functional equations, they are able to compute 

(6.7) p{n, k) := cj^izT 

l^j an,j n 

where 'tp{z) = (z/3)(l — z/S)"^ and (j){z) = 3(1 + z)^ are the operators that arise in the functional 
equation for M{u, z). 


The mean size of the core was computed in GW 99 l |. For this it sufficed to prove a specialized 
result about coefficients of powers of functions asymptotic to (1 — as z ^ 1 . The problem of 

finding a limit law for the distribution about the mean was taken up in BFSS0l|. One may arrive 
directly at an asymptotic formula for p{n, k) if one rewrites (16.71) as 

p{n,k) = - a;'=j/”z” --—. 

n [I - x'il!{z))(l - y(j){z)) 

The analysis of the resulting trivariate generating function is, however, q uite cha llenging. In partic¬ 
ular, there is a point where Q vanishes; a generalization of Theorem 13.191 PW02L Theorem 3.3] tells 
us that the asymptotics in this precise direction, but does not answer the more interesting question 
of asymptotics in a scaling window near that direction, which turns out to be k = n/3 ± An 

answer, in the framework of PW02|, is gi ven in the doctoral work of M. Lladser Lla03l Section 5.5]. 
A complete answer is given in BFSS0l| by reductions to a one-variable generating function, on 
which a coalescing-saddle approximation is used. We will not go into details here. 


7. The kernel method 

The kernel method is a means of producing a generating function for an array {or : r € of 
numbers satisfying a linear recurrence 

(7.1) Ur — ^ ^ CgUr—s ■ 

seE 

Here if is a finite subset of \ {0} which is not necessarily a subset of but whose convex 
hull must not intersect the closed negative orthant. The numbers {cg : s G B} are constants and 
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the relation (17.ip holds for all r except those in the boundary condition, which will be made 
precise below. As usual, let F{'z) = ^ OrZ''. In one variable F is always a rational function, but in 
more than one variable the generating function can be rational, algebraic, D-finite, or differentially 
transcendental (not D-finite). A classification along these lines, determ ined mo re or less by the 
number of coordinates in which points of E can be negative, is given in BMPOOj, which is a very 
nice exposition of the kernel method at an elementary level. 


We are interested in the kernel method because it often produces generating functions to which 
Theorem 13. 191 may be applied. Because t he method is not all that well known, we include a detailed 
description, drawing heavily on BMP00|. We begin, though, with an example. 


7.1. A random walk problem. Two players move their tokens toward the finish square, flipping 
a fair coin each time to see who moves forward one square. At present the distances to the finish 
are 1 -(- r and 1 -I- r -b s. If the second player passes the first player, the second player wins; if the 
first player reaches the finish square, the first player wins; if both players are on the square before 
the finish square, it is a draw. What is the probability of a draw? 


Let ars be the probability of a draw, starting with initial positions 1 +r and l-l-r-l-s. Conditioning 
on which player moves first, one find the recursion 

_ — l “t“ Clr—l,s+l 

^ 

which is valid for all (r, s) > (0,0) except for (0,0), provided that we define a^s to be zero when 
one or more coordinate is negative. The relation ars — (l/2)ar,s-i — (l/2)ar-i,s+i = 0 suggests 
we multiply the generating function F{x,y) := J2^rsX^y^ by 1 — {l/2)y — {\/2){x/y). To clear 
denominators, we multiply by 2y: define Q{x, y) =2y — y^ — x and compute Q • F. We see that the 
coefficients of this vanish with two exceptions: the x^y^ coefficient corresponds is 2ao,o~ao,-i~0-i,i 
which is equal to 2, not 0, because the recursion does not hold at (0,0) (ago is set equal to 1); the 
y^x^ coefficients do not vanish for j > 1 because, due to clearing the denominator, these correspond 
to 2aj^-i — aj ,-2 ~ cij-ifi. This expression is nonzero since, by definition, only the third term is 
nonzero, but the value of the expression is not given by prescribed boundary conditions. That is, 
we have 


(7.2) 


Q(x, y)F{x, y) = 2y- h{x) 


where h{x) = = xF(x,0) will not be known until we solve for F. 


This generating function is in fact a simpler variant of the one derived in LL99l| for the waiting 
time until the two players collide, which is needed in the analysis of a sorting algorithm. Their 
solution is to observe that there is an analytic curve in a neighborhood of the origin on which Q 
vanishes. Solving Q = 0 for y in fact yields two solutions, one of which, y = ^{x) := 1 — \/l — x, 
vanishes at the origin. Since ^ has a positive radius of convergence, we have, at the level of formal 
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power series, that Q{x,^{x)) = 0, and substituting ^{x) for y in (17.211 gives 

0 = Q(x, ^(x))F(x, C(x)) = 2^(x) - h{x). 

Thus h{x) = 2^{x) and 


F{x,y) = 2 


y-i{x) ^ 1 

Q{x,y) l + y/l-x-y 


As is typical of the kernel method, the generating function F has a pole along the branch of 
the kernel variety {Q(x,y) = 0} that does not pass through the origin. The function F is not 
meromorphic everywhere, having a branch singularity on the line x = 1, but it is meromorphic 
in neighborhoods of polydisks T){x,y) for minimal points (x, 1 + — x) on the pole variety for 

0 < X < 1. For 0 < X < 1, Mir{x, 1 + Vl ~ x) = (2^/1 — x + 2 — 2x)/x. If we set this equal to A 
and solve for x we find x = 4(1 + A)/(2 + A)^ and 1 + y'l — x = (2 + 2A)/(2 + A). In other words. 


ficontribr = 

where X = s/r. Plugging this into (|3.6I) gives 


4(1 +A) 2(1 +A) 
(2 + A)2’ 2 +A 




2 (r + s) 
2r + s 


C (2r + s)2’'+« 

“ 22r+s _|_ s)(r-+s) ' 

One recognizes in this formula the asymptotics of the binomial coefficient and indeed the 

binomial coefficient may be obtained via a combinatorial analysis of the random walk paths. 


7.2. Explanation of the kernel method. Because of the applicability of Theorem 13.191 to gen¬ 
erating functions derived from the kernel method, we now give a short explanation of this method. 
We adopt the notation from the first paragraph of this section. 

Let p be the coordinatewise inhmum of points in EU {0}, that is the greatest element of such 
that p < s for every s G E U {0}. Let 

Q(z) := z-P [ 1 - ^ c.z^ 

V s&E 

where the normalization by z^p guarantees that Q is a polynomial but not divisible by any Zj. 
Partition into two sets, Z and B, and assume that the relation dD holds for all r G Z; the set B 
must be closed under < and the values {or ; r G B} are specified explicitly by constants {b^ : r G B} 
rather than by jni). 

If B C then p = 0, B can be arbitrary, B is a function of the form G/Q, and G is rational 
if the boundary conditions are rational. The analysis in this case is straightforward and the kernel 
method yields only what may be derived directly from the recursion for in terms of {og : s < r}. 
For examples of this, see Sections 14.11 and 14.21 We concentrate instead on the case where d = 2 and 
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the second coordinate of points in E may be negative. It is known in this case BMPOfi Theorem 13] 
that if the generating function for the boundary conditions is algebraic then F is algebraic. On the 
other hand, we shall see that an outcome of the kernel method is that F will have a pole variety, 
and will usually satisfy the meromorphicity condition in the remark after Theorem 13. 161 


To apply the kernel method, one examines the product QFz, where for convenience we have let 
Fz '■= “rZ*' be the generating function for those values for which the recursion (|7.1D holds. If 

we assume that the generating function F^ '■= for the prescribed boundary conditions is 

known, then since F = Fz + Tb, finding Fz is equivalent to finding F. 


There are two kinds of contribution to QFz- Firstly, for every pair (r, s) with s G E, r G Z and 
r — s G B, there is a term Cg&r-sZ'’^^: the corresponding coefficient of QF vanishes, but Fz has 
been stripped of the boundary terms that cause the cancellation. Let 


K{z) := ^ Cs6r-sz" P 


denote the sum of these terms. The “iF” stands for “known”, because the coefficients of K are 
determined by the boundary conditions, which are known. Secondly, for every pair (r, s) with 
s G E,r — s G B and r ^ Z there is a term —due to the fact that the recursion does not 
hold at r. 

C/(z) = ^ Csar-sZ'’”P 

r—sGZ,sGE,r^Z 

denote these terms. The “U” stands for “unknown”, because these coefficients are not explicitly 
determined from the boundary conditions. It is not hard to show that for any dimension d and any 
E whose convex hull does not contain a neighborhood of the origin, the following result holds. 


Proposition 7.1 ( BMPOOl Theorem 5]). Let E, {cg}, p, B, Z, {bg : s G B}, Fz and K be as 
above. Then there is a unique set of values {a^ ■ r G Z} for such that |y.l[ ) holds for all r G Z. 
Consequently, there is a unique pair {F, U) of formal power series such that 


QFz =K-U. 


□ 


In other words, the unknown power series U is determined from the data along with Fz- Another 
way of thinking about this is that Fz is trying to be the power series K/Q but since Q vanishes 
at the origin, one must subtract some terms from K to cancel the function Qq where Q = QoQi 
an d Qn con sists of the branches of Q passing through the origin. The kernel method, as presented 


BMP0nl | turns this intuition into a precise statement. 


Proposition 7.2 ( BMPOOL equation (24)]). Suppose d = 2, 
be exactly p formal power series fi,... ,fp such that Cj(0) = 


p = (0, —p) and Fb = 1. There will 
0 and Q{x,f,j{x)) = 0, and we may 
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write Q{x, y) = —C{x) 0^=1 iv ~ YVj=i{y ~ Pji^)) Z®’’ some r and pi,..., pr- The generating 

function Fz will then be given by 

K{x,y)-U{x,y) _irj=iiy - ^jix)) _ 1 

Q{x,y) - Q{x,y) ~ -C{x)Y{]^^{y-p,{x))- 

□ 


We turn to some examples. 

7.3. Dyck, Motzkin, Schroder, and generalized Dyck paths. Let i? be a set {(ri, si),..., (r^, Sfe)} 
of integer vectors with rj > 0 for all j and min^- Sj = —p < 0 < maxj Sj = P. The generalized Dyck 
paths with increments in E to the point (r, s) in the first quadrant are the paths from (0, 0) to (r, s), 
with increments in E, which never go below the horizontal axis. 



Figure 11. a generalized Dyck path of length nine with E = {(1,2), (1,-1)} 


Let E(x, y) = s ^rsX^y'^ where a^s is the number of generalized Dyck paths to the point (r, s). 
We will have B = {(0,0)}, Fb = 1, Q{x,y) = yP{l - Y,, x'^' y'"'), and C{x) = Yi:si=p ■ We now 
discuss three well known instances of such paths taken from BMPOC |. 


Dyck paths 


Motzkin paths 


Schroder paths 
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Figure 12. legal steps for three types of paths 


Dyck paths: When E = {(1,1), (1, —1)} we have the original Dyck paths. We have Q{x,y) = 
y — xy'^ — x. Here C{x) = x, and Q{x, y) = —x{y — ^{x)){y — p{x)) where ^{x) = (1 —Vl — 4:x'^)/{2x) 
and p{x) = (1 + a/I — 4a;^)/(2a:) is the algebraic conjugate of Note that p is a formal Laurent 
series and = 1. 


Thus we have, following the discussion above, 


F{x,y) 


1 

-x{y- p{x)) 


^{x)/x 
1 - y^{x )■ 


Setting y = 0 recovers the fact that the Dyck paths coming back to the x-axis at (2n, 0) are counted 
by the Catalan number Cn- 


Asymptotics are readily obtained either using the explicit or implicit form of ^ (noting the peri¬ 
odicity of ^). Let us use the implicit form in this example, since we will be illustrating use of the 
explicit form below in the case of Schroder paths, where Lagrange inversion does not apply. The 
vanishing of Q{x, y) = y — xy'^ — x occurs when y = X(p{y) with (j){y) = l + y'^. Thus ^ = X(p{^). We 
use a slight variant of (|6.5I) . namely 




1 

^2TTna'^{(j);yx) 


where X = k/n and p{(j); y\) = 1 — A. 


The chain rule yields p{(t)]y) = y(t)y/4> = TJft’tty/fl) = (j)) = + ^t) with 

t = Thus we solve 1 — A = 2t/(l -I- t), or = (1 — A)/(l -|- A) = (r — s)/(r -|- s). The 
two contributing points y\ cancel out if r — s is odd and reinforce if r — s is even. We compute 
</'(j/a) = 2/(1 -|- A) = 2r/(r -I- s) and a‘^{(j)\y\) = 1 — A^ = (r^ — s^)/r^. We obtain 


provided r 


CLrs 


^s/2r\’''''^/r-sA^ ^ 

r\r + s) \r + s) y^27r(r - s)(r -h s) 
2s E a/t 

(r + s) (i^)^(l^)^ 


s is even, and 0 otherwise. This is uniform for 0 < 6 < s/r < 1 — e< 1. 


Of course, in this simple example Lagrange inversion also gives an exact formula involving binomial 
coefficients, namely 


(7.3) ars = K+^]e(x)^+^ = 


r -I- 1 


= 


s -I- 1 


1 


^ + 1 V(^ - ■s)/2. 


2(s + 1) r! 
r-hs-f 2 (i:^)!(^:^)! 


when r — s is even, and 0 otherwise. If we had instead computed using (16.51) . we 

would have obtained a correct leading order asymptotic of a slightly different form (r, s replaced by 
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r + 1, s + 1 in some places). In each case these asymptotics are consistent with what is obtained by 
applying Stirling’s formula to the factorials in (j7.3ll . 


Motzkin paths: Let if = {(1,1), (1,0), (1, —I)}- In this case the generalized Dyck paths are known 
as Motzkin paths. Again we have case Q{x,y) = y — xy^ — x — xy. Now p and ^ are given by 
(1 — a; ± \/\ — 2x — 2>x'^)/{2x) and again 

F(x ) = ^_ 

This time ^ is given implicitly by ^ = a:(l + ^ + and the coefficients are not binomial coefficients, 
but the asymptotics are no harder to compute. Here, with A = s/r, we have that ficontribi_A is a 
singleton {j/a} by aperiodicity. The critical point equation is 

(l + A)2/2+Ay-(l-A) = 0. 

The solution is 

V4 - 3A2 - A 

2(1 +A) ■ 

The minimal polynomial for a'^{(j);y\) is found as in earlier sections to be 

35^ + (BA^ + 12A - 2)5' + 3A‘‘ - 24:X^ + bSA^ - 68A + 24. 

This polynomial has two positive solutions for 5 for each given A. The correct one is found by noting 
that approaches 0 as A approaches 0. 


Schroder paths: Here E = {(1,1), (2,0), (1,-1)}. We have C{x) = x, Q{x, y) = y — xy"^ — x'^y — 
X, and p and ^ are given by (1 — ± a/1 — 6a;^ + x'^)/{2x). This time Lagrange inversion does 

not obviously apply. We perform an explicit computation, noting the periodicity of F. We have 
fidzr(a:, y) =v with 

^ s \/l — 6a:^ + x"^ 
r 1 + a;^ 

Then A decreases from 1 to 0 as a; increases from 0 to the smaller positive root of 1 — 6a:^ + a:^, 
namely a/2 — 1. We also have 

2 3 + A^ - 2a/(2 + 2A2) 

"" “ r^A2 ■ 

Choosing the positive value of a;, we see that asymptotics are given by 

Qrs ~ 2Cx~^y~^s~^^‘^ 

where y = l/^{a;), when r + s is even, and 0 otherwise. Any particular diagonal (with a value of A 
between 0 and 1) can be extracted easily. For example, with A = 1/3, we obtain 03^,5 ~ 2C7'*s“^/^, 
with a: = (3-75)72,2/ = (l + 75)/2, 7 = (ll + 575)/2 « 11.09016992, and C « 0.1526195310. For 
s = 12 this approximation underestimates by about 2.9%, and for s = 24 by about 1.5%. 
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7.4. Pebble configurations. Chung, Graham, Morrison and Odlyzlo CGM09 !t| consider the fol¬ 
lowing problem. Pebbles are placed on the nonnegative integer points of the plane. The pebble 
at (i, j) may be replaced by two pebbles, one at {i + l,j) and one at {i,j -\- 1), provided this does 
not cause two pebbles to occupy the same point. Starting from a single pebble at the origin, it is 
known to be impossibl e to move all pebbles to infinity; in fact it is impossible to clear the region 
{(i, j) : 1 < 7-|-j < 2} CGM095 . Lemma 2]. 


They consider the problem of enumerating minimal unavoidable configurations. More specifically, 
say that a set T is a minimal unavoidable configuration with respect to some starting configu¬ 
ration S if it is impossible starting from S to move all pebbles off of T, but pebbles may be cleared 
from any proper subset of T. Let St denote the starting configuration where {i,j) is occupied if and 
only if z -I- j = t. Let ft{k) denote the number of sets in the region {(z, j) '■ i, j > ^ + j > t + 1} 

that are minimal unavoidable configurations for the starting configuration St- 


They derive the recurrence 


ft{k) = ft-i{k) + 2ft{k - 1) + ft+i{k - 2) 


which holds whenever t > 3 and k >2. Let 

F{x,y) = ft{k)x^y*. 

i,/c>0 

According to the kernel method, we will have F = rj/Q for some rj vanishing on the zero set of Q 
near the origin, where Q{x,y) = x — {x + y)^. Using some more identities, Chung et al. are able 
to evaluate F explicitly. They state that they are primarily interested in fo{k), so they specialize 
to F(x, 0) and compute the univariate asymptotics. It seems to us that the values ft{k) are of 
comparable interest, and we pursue asymptotics of the full generating function. 

The formula for F is cumbersome, but its principal features are (i) a denominator oi P ■ Q where 
P is the univariate polynomial 1 — 7x -I- \Ax^ — 9a;^, and (ii) F is algebraic of degree 2 and is in 
C[x, y]Wl — Ax\. The minimum modulus root of P is xo ~ 0.2410859.... The algebraic singularity 
of F occurs along the line x = 1/4, so conveniently, the branching is completely outside the closure of 
the domain of convergence and the meromorphicity assumption of the remark following Theorem l3.16l 
is satisfied. 

The boundary of the domain of convergence in the first quadrant is composed of pieces of two 
curves, namely a; = ccq and y = (x -\- yY ■ These intersect at the point (xo,yo) where yg = (1 — 
2x + -v/1 — Ax)l2 « 0.3533286. We are in the combinatorial case, so we know that minimal points 
will be found along these curves. Along the curve y = {x + j/)^ the direction ^dir(x, y) is given by 
X = r/s = (2 — 2^)f {2^ — 1 ). As {x, y) travels from (0,1) down the curve to (a;o, z/o), A increases 
from 0 to Ao « 4.295798.... At the point (xg,yg), the cone K(a;o,z/o) is the convex hull of the 
positive a;-direction the direction (Aq, 1). 
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Figure 13. the domain of convergence and the algebraic singularity 


We then have two sorts of asymptotics for ft{k). When t < fc/Ao, the asymptotics are given by 
Corollary 13.251 In this case we may evaluate G{xo,yo) « 0.00154376 and ~ 0.02925688 

so that 

/t(fc) ^ with C = 0.05276 ... 


uniformly as t/k varies over compact subsets of (0, I/Aq). It is interesti ng to comp are to the asymp¬ 
totics for /o(fc). Setting ?/ = 0 gives the univariate generating function |CGM095 | 

I (1 — 4a:)^/^(l — 3a; -|- x^) — 1 + bx — x^ — Qx^ 


f{x) = Y. 


QnX = X 


P{a 


We may compute 


and we find that 


lim QuXq = lim (xq — x)f{x) 

n^oo X—*xq 


fo{k) cXq ^ with c = 0.016762 .... 

In fact, one may calculate lim/t(fc)xg for any fixed t by computing 

(7.4) F(x,0) 

and again computing c{t) := lima;^^;;, (xq — x)ft{x). The pole at x = xq in F is removable: replacing 
the denominator PQ of F by {P/{x — xq))Q we see that g{y) := lim.x^xo{xo — x)F{x,y) has a simple 
pole a,t y = yo and that limt^oo i/Qc(t) = limy^y(j(i/o — y)g{y) is equal to C. In other words, we 
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see that the asymptotics known to hold uniformly as t/k varies over compact subsets of (0,1/Ao) 
actually hold over [0,1/Ao) as long as t —> oo, while for fc ^ oo with t fixed we use (|7.4I1 . 


On the other hand, when t/k > I/Aq we may solve for x and y to get 

/ k{2t + k) {2t + k)^ 


ficontribr = {x,y) := 


V(2t + 2r)2 ’ {2t + 2ky 


We now use Theorem 13. 191 to see that 

Mk)c{i)t-^/^ 


(2t + 2fc)2‘+2fc 
k^(2t + A:)2*+fc ■ 


The asymptotics in this case appear similar to those for the binomial coefficient • As opposed 

to the situation with Dyck paths, one may check that /t(fc) is not equal to a binomial coefficient. 


8. Discussion of other methods 


8.1. GF-sequence methods. The bench mark w ork in the area of multivariate asymptotics is still 


the 1983 article of Bender and Richmond 
rem 


BR,83l | . Their main result is a local central limit theo- 


BR83l Corollary 2] with the exact same conclusion as Theorem 13.291 Their hypotheses are: 


(i) Or > 0; 

(ii) F has an algebraic singularity of order q ^ {0,-1,—2,...} on the graph of a function 

Zd = g{zi,..., Zd-i)-, 

(iii) F is analytic and bounded in a larger polydisk, if one excludes a neighborhood of Im {zj) = 0 
for each j ; 

(iv) B is nonsingular. 


Comparing this to the results presented herein, we find both methodological and phenomenological 
differences. One methodological difference is that they view a d-variate generating function F as 
a sequence {Fn} of {d — l)-varia te gene rating functions. Their main result on coefficients of F is 
derived as a corollary of a result |BR83l . Theorem 2] on sequences satisfying F„ ~ Cng/i" for some 
appropriately smooth g and h. As we have remarked, this approach is natural for some but not all 
applications, and leads to some asymmetry in the hypotheses and conclusions. 


A more important methodological difference is that while we always work in the analytic category, 
Bender and Richmond use a blend of analytic and smooth techniquej^. This manifests itself in the 
hypotheses: where we require meromorphicity in a slightly enlarged polydisk, they require that the 
function g be in , that the residue (1 — z/g^F be in (7°, and that F be analytic away from the 
real coordinate planes. While our hypotheses are stronger in this regard, we know of no applications 
where their assumptions hold without (1 — z/g^F being analytic. Bender and Richmond gain 
generality by allowing q to be nonintegral. This is further exploited by Gao and Richmond, where 


®Analyticity is used only once, when they rotate the quadratic form B. 
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the singularity is allowed to be algebraico-logarithmic GR,92l . Corollary 3]. On the other hand, 
their methods entail estimates and therefore cannot handle cancellation, such as occurs when Or 
have mixed signs and the dominant singularity lies beyond the domain of convergence of F (see 
subsections 14.101 and 14.31) . 


Phenomenologically, there is a significant difference in generality between our methods and those 
of Bender, Richmond, Gao, et al. Their results govern only the case where a local central limit 
theorem holds - indeed it seems they are interested mainly or only in this case. Other behaviors of 
interest, which have been analyzed in the literature by various means, include Airy-type limits (see 
Section lOD . polynomial growth (see Section 14.111) . and elliptic-type limits. Central limit behavior 
results from smooth points z(r) with nondegenerate quadratic approximations to /ir(z(r)), while 
Airy-type limits result from degenerate quadratic approximations, polynomial growth or corrections 
result when z(r) is a multiple point, and elliptic-type limits result from bad points. By restricting 
our exposition to the simplest cases, we have stayed mainly within the smooth point case, where 
{or} obeys a LCLT and the methods of Bender et al. apply, in most cases equally well as the 
results in Section 13.31 But the advantage of multivariate analytic methods is that they can in 
principle be applied to any case in which F is meromorphic or algebraic. Thus, in addition to the 
further generality covered i n Section 13.41 o f this paper, the same general method has been used to 
produce Airy limit results 


PW02 


L4a03|, and is being applied to algebraic generating functions 


and meromorphic functions with bad point singularities, the simplest of which are quadratic cones. 
While this work is not yet published, it appears that it will prov ide another proof of the elliptic 
limit results for tiling statistics on the Aztec Diamond GEP96l | that is capable of generalizing 
to any quadratic cone singularity. This would prove similar behavior in two cases where such 
behavior is only conjectured, namely cube groves PSOsI ] and quantum random walks (Chris Moore, 
personal communication), as well as unifying these results with the analyses of the coefficient s of the 
generating function 1/(1 — x — y — z Axyz) that arise in the study of super ballot numbers [Ges92 | 
and Laguerre polynomials [GRZ83|. 


8.2. The diagonal method. There is a third method for obtaining multivariate asymptotics that 
deserves mention, namely the so-called “diagonal method”. This derives a univariate generating 
function f{z) = main diagonal of a bivariate generating function F{x,y) = 

J2rs^rsX^y^ ■ The asymptotics of may then be read off by standard univariate means from 
the function /. The method may be adapted to compute a generating function for the coefficients 
(inp,nq along any line of rat ional slope. T his method, long known in various literatu res, ent ered the 
combinatorics literature in 


Pur67; 


HK71| : our exposition is taken from [Section 6.3][Sta99|. 


While this elegant method produces an actual generating function, which is more informative 
than the diagonal asymptotics, its scope is quite limited. First, while asymptotics in any rational 
direction may be obtained, the complexity of the computation of anp,nq increases with p and q. 
Thus there is no continuity of complexity, and no way to obtain uniform asymptotics or asymptotics 
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in irrational directions. Secondly, the result is strictly bivariate. Thirdly, even when the diagonal 
method may be applied, the computation is typically very unwieldy. 


As an example, consider the generating function 


I — X — y + xy — x^y 


_ t . 2,,2 


which MZS04| enumerates binary words without zig-zags (a zigzag is defined to be a subword 010 or 
101 — the terminology comes from the usual correspondence of such words with Dyck paths, where 
0,1 respectively correspond to the steps (1,1), (1, —1)). Here m, n respectively denote the number 
of O’s and I’s in the word. The main diagonal enumerates zigzag-free words with an equal number 
of O’s and I’s. The solutions to H = 0, xH^ = yHy are given hy x = y = 1/^, ^ = (1 -I- •\/5)/2, and 
X = y = {1 + Thus ficontrib is a singleton {(!/</>, !/</>)} and the first order asymptotic is 

readily computed to be 

2 




The computation for any other diagonal is analogous, with the same amount of computational 
effort, and the asymptotics are uniform over any compact subset of directions keeping away from 
the coordinate axes. 


To obtain the same result via the diagonal method requires the following steps. For each fixed t 
near 0, we compute the integral 


D{z) := ^ o„„z" 

n 


1 

2771 


f , , . dz 

/ F{z,tlz) — 
Jct ^ 


1 f l + t + t^ 

2777 -f (1 -I- t — F)z — t 


where the contour is a circle that encloses all the poles of F{z,tjz)lz satisfying z{t) ^ 0 as t —> 0. 
Since F{z,0)/z has a single simple pole at z = 0, the same is true of F{z,tfz)lz for sufficiently 
small t. In this simple example we can explicitly solve for the pole z{t) and compute its residue so 
that we obtain the result 

n I 1 + 2; -f 

^ annZ - _ 3 ^ ^2 • 

n 

In other cases, after some manipulation we obtain anuz"^ implicitly as the solution of an algebraic 
equation. We are then faced with the problem of extracting asymptotics, which can probably 
be done using univariate techniques. However, we have left the realm of meromorphic series and 
this can complicate matters. In the example above, the branching occurs outside the domain of 
convergence, and the asymptotics are controlled by the dominant pole at (the minimal zero of 
the denominator). Thus one obtains the same asymptotic as above, after some effort. 


If we want to repeat this computation with required to find all small poles 

of the function F{z‘^,t/zP). It is unlikely that these may be found explicitly, which complicates the 
task of finding which ones go to zero and computing the residues there. 
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Finally, another serious problem faced by the diagonal method is that while the diagonal of 
a rational series in d = 2 variables is always algebraic, a fact which can itself be proved by the 
diagonal met hod, in d > 3 variables, the diagonal of a rational series must be D-finite but may not 
be algebraic Lip88j| . Thus a description of the diagonal generating function is more challenging. 
For example, consider the generating function F{x, y,z) = {l — x — y — z)~^ = ^ Qrstx^y^z*, whose 
diagonal coefficient an,n,n is the multinomial coefficient • This is known not to be algebraic, 

since its asymptotic leading term Ca'^n~^ is not of the right form for an algebraic function. It 
is completely routine to derive this asymptotic using the methods of the present article, but any 
method that relies on an exact description of the diagonal will clearly require substantial extra work. 
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